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Abstract 

As part of our program of lattice simulations of three flavor QCD with improved staggered quarks, 
we have calculated pseudoscalar meson masses and decay constants for a range of valence quark 
masses and sea quark masses on lattices with lattice spacings of about 0.125 fm and 0.09 fm. We fit 
the lattice data to forms computed with "staggered chiral perturbation theory." Our results provide 
a sensitive test of the lattice simulations, and especially of the chiral behavior, including the effects 
of chiral logarithms. We find: U = 129.5 ±0.9 ±3.5 MeV, fx = 156.6 ±1.0 ±3.6 MeV, and fx/U = 
1.210(4)(13), where the errors are statistical and systematic. Following a recent paper by Marciano, 
our value of Jk/ f-K implies \ Vus\ = 0.2219(26). Further, we obtain mu/rrid = 0.43(0) (1) (8), where 
the errors are from statistics, simulation systematics, and electromagnetic efi^ects, respectively. The 
partially quenched data can also be used to determine several of the constants of the low energy 
chiral effective Lagrangian: in particular we find 2Ls — -L5 = —0.2(1) (2) x 10~^ at chiral scale 
rriri, where the errors are statistical and systematic. This provides an alternative (though not 
independent) way of estimating m„; the value of 2Ls — L5 is far outside the range that would allow 
the up quark to be massless. Results for m^^, m^^, and ms/rh can be obtained from the same 
lattice data and chiral fits, and have been presented previously in joint work with the HPQCD 
and UKQCD collaborations. Using the perturbative mass renormalization reported in that work, 
we obtain = 1.7(0)(1)(2)(2) MeV and rn|^ = 3.9(0)(1)(4)(2) MeV at scale 2 GeV, with errors 
from statistics, simulation, perturbation theory, and electromagnetic effects, respectively. 

PACS numbers: 12.38.Gc, 12.39.Fe, 12.15.Hh 
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I. INTRODUCTION 



Using lattice QCD techniques, the masses and decay constants of hght pseudoscalar 
mesons can be determined with high precision at fixed quark mass and lattice spacing. 
Assuming that the chiral and continuum extrapolations are under control, one can therefore 
calculate from first principles a number of physically important quantities, including 

• Pion and kaon leptonic decay constants, and fx, and their ratio. 

• Low energy ( "Gasser-Leutwyler" constants Lj, in particular L5, L4, and the com- 
binations 2Ls — L5 and 2Lq — L4. 

• Quark mass ratios, such as rris/rh, where rh is the average of the u and d quark masses, 
and rriu/md. 

• Absolute quark mass values, if the mass renormalization constant is known perturba- 
tively or nonperturbatively. 

The comparison of fjr and fx with experiment provides a sensitive test of lattice methods 
and algorithms. A precise determination of fx, or fx/ fn niay in fact be turned around to 
determine the magnitude of the CKM element Vus, as emphasized recently by Marciano 
P]. The quark masses are fundamental parameters of the Standard Model, and hence are 
phenomenologically and intrinsically interesting. Of special importance here is the up quark 
mass: if m„ or m^/m^ can be bounded away from zero with small enough errors, it can rule 
out niu = as a solution to the strong CP problem ^,0].^ Finally, the Gasser-Leutwyler 
parameters give a concise summary of the properties of low energy QCD. In particular the 
combination 2Lh — provides an alternative (although not independent) handle on the up 
quark mass 0, Q] . 

Extracting these important quantities is predicated on being able to control the chiral 



IS 



and continuum extrapolations. The improved staggered (Kogut-Susskind, KS) quarks 
used here have the advantage in this respect of allowing us to simulate at quite small quark 
mass: Our lowest mT^/nip value is ~ 0.3, a pion mass of roughly 250 MeV. On the other 



^ We note that Creutz [g has argued that the statement to„ = is not physically meaningful and therefore 
cannot be a resolution of the CP problem. Since we find a non-zero value for m„ here, we are not forced 
to face this issue directly in the current work. 
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hand, these extrapolations are comphcated by the fact that a single staggered quark field 
describes four species of quarks. We call this degree of freedom "taste" to distinguish it 
from physical fiavor. We simulate the latter by introducing distinct staggered fields for each 
nondegenerate quark fiavor; while we handle the former by taking the fourth root of the 
staggered quark determinant. 

The fact that taste symmetry is violated at finite lattice spacing leads to both practi- 
cal and theoretical complications. The improvement of the fermion action jsj reduces the 
splittings among pseudoscalar mesons of various tastes to 0(a|a^); yet the splittings are 
still numerically large, especially on our coarser lattices. This practical problem makes it 
impossible to fit our data with continuum chiral perturbation theory (XPT) expressions (see 
Refs. [3| and 11|, as well as discussion in Sec. llXDl) . Instead, we must use "staggered chiral 



perturbation theory" (SXPT) jl^ 13, Q, |l5|, which includes discretization effects within 



the chiral expansion. Using SXPT, we can take the chiral and continuum limits at the same 
time, and arrive at physical results with rather small systematic errors. 

Theoretically, it is not obvious that, in the presence of taste- violations, the fourth root 
procedure commutes with the limit of lattice spacing a 0. Assuming that perturbation 
theory for the standard KS theory without the fourth root correctly reproduces a continuum 
four-taste theory, then the fourth root trick is correct in perturbation theory since it 
just multiplies each virtual quark loop by 1/4. However, nonperturbatively, the fourth root 
version is almost certainly not ultra-local at finite lattice spacing, and the possibility remains 
that it violates locality (and therefore universality) in the continuum limit. We believe that 
ex.«„g cheC. y y Q of t.e fo..a,.. iain. e.pen.e,..I al.ead, 
this possibility unlikely. The current work adds more evidence that the method gives results 
that agree well with experiment and have the proper chiral behavior, up to controlled taste- 
violating effects that vanish in the continuum limit. However the question is not yet settled. 
We discuss this further in Sec. IIX D 71 and briefly refer to other recent work that addresses 
the issue. 

This violation of taste symmetry arises because the full axial symmetry (at rriq = 0) is 
broken to a single U(l) subgroup on the lattice. This means that only one of the pseu- 
doscalars, which we call the "Goldstone meson" , has its mass and decay constant protected 
from renormalization. A study of pion masses and decay constants by the JLQCD collabo- 
ration |2l| explored the masses and decay constants of all of the pseudoscalars in a quenched 
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calculation. We concentrate almost exclusively here on Goldstone mesons, thus avoiding the 
necessity for renormalization. 

We have generated a large "partially quenched" data set of Goldstone meson masses and 
decay constants using three flavors of improved KS sea quarks. These quantities have been 
computed with a wide range of sea quark masses (with m„ = m^^ 7^ ^s), and on lattices 
with lattice spacings of about 0.125 fm and 0.09 fm. We have 8 or 9 different valence quark 
masses available for each set of sea quark masses and lattice spacing. This data may be fit 
to chiral-logarithm forms from SXPT, which at present have been computed for Goldstone 



mesons only 



14L Il5 |. However, since the masses for mesons of other tastes enter into the 



one-loop chiral logarithms of the Goldstone mesons, some control over those masses is also 
needed. We have computed most non-Goldstone "full QCD" (valence masses equal to sea 
masses) pion masses on most of our lattices. We can fit that data to the tree-level (LO) 
SXPT form, and use the results for splitting and slopes as input to the NLO terms for the 
Goldstone mesons. There is, of course, a NNLO error in this procedure, which we estimate 
in Sec. IvTrI 

The outline of the rest of this paper is as follows: SectionlTTlexplains the methodology used 
to compute raw lattice results (at fixed a and fixed quark mass). In Sec. 11111 we describe the 
details of our simulations. We present a first look at the raw data in Sec. IIVI Taste violations 
are discussed in Sec. El followed in Sec. EH by a detailed description of our SXPT fitting 
forms. Relevant results from weak coupling perturbation theory are collected in Sec. I VI II 
At the current level of precision, electromagnetic and isospin-violating effects cannot be 
ignored, and we discuss the necessary corrections and the attendant systematic errors in 
Sec. IVIIII Section ITX] then presents the SXPT fits, including a description of fit ranges (in 
quark mass), an inventory of all fit parameters, the resulting fits, and a discussion of various 
issues relevant to the extraction of physical results. The discussion includes details of the 
continuum extrapolation, the evidence for chiral logarithms, an estimate of the systematic 
errors associated with using a (slightly) mass-dependent renormalization scheme, a critical 
look at the applicability and convergence of the chiral perturbation theory on our data set, 
bounds on residual finite volume effects, and some comments relevant to the fourth-root 
trick. In Sec. El we present our final results, tabulate the systematic errors, and discuss 
prospects for improving the current determinations. 
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In collaboration with the HPQCD and UKQCD groups, we have previously reported 
results for m^^, the average u-d quark mass rh^^, and rris/rh 22]. The data sets and chiral 
fits described in detail here are the same ones that were used in Ref. 12211 . 



II. METHODOLOGY 

For the axial current corresponding to the unbroken (except by quark mass) axial sym- 
metry, the decay constant fps can be found from the matrix element of ipj^ip between the 
vacuum and the pseudoscalar meson. In terms of the one component staggered fermion field 
ijj'j^ip corresponds to the operator 

Opit) = rix,t){-if+'x''{x,t) . (1) 

Here a is a summed color index. The relevant matrix element can be obtained from a 
pseudoscalar propagator using Op as both the source and sink operator: 

Ppp{t) = ^(Op(f , 0)Op{y, t)) = Cppe-'^''^^ + excited state contributions , (2) 
y 

where mps is the mass of the pseudoscalar, and K is the spatial volume. 
The decay constant is obtained from Cpp by |2lL l23l| 




fps = {mx + my)J , (3) 



where and my are the two valence quark masses in the pseudoscalar meson. Through- 
out this paper we use the convention where the experimental value of /^r is approximately 
131 MeV. Note that in computing this meson propagator we must take care to normalize the 
lattice Dirac matrix as M = am + p. The four in the denominator arises from the number 
of tastes natural to the Ko g ut- S usskind formulation. (See unnumbered equations between 



Eqs. 7.2 and 7.3 in Ref. |23|.)2 

However, the point operator Op has large overlap with excited states. For calculating 
masses it is customary to use an extended source operator that suppresses these overlaps, 



^ We thank C. Davies, G. P. Lepage, J. Shigemitsu and M. Wingate for help in getting this normaUzation 
correct. 
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together with a point sink. In our case, this extended operator is a "Coulomb wall," i.e., we 
fix to the lattice Coulomb gauge and sum over all lattice points on a timeslice: 

Ow{t) = Y^xix,t)i-lf+\{y,t) . (4) 

We can calculate propagators with any source or sink operator we wish. Ignoring excited 
state contributions, we have for example 

{Opix,0)Owit)) = Cpwe-'^-' . (5) 

We will use the shorthand "PP" for point-source point-sink propagators, "WP" for Coulomb- 
wall-source point-sink propagators, "PW" for point-source Coulomb-wall-sink propagators, 
and 'WW" for Coulomb-wall source and sink propagators. In previous calculations of 
pseudoscalar decay constants the relation Cpp = C^p/Cww has often been used to get the 
point-point amplitude. However, the wall-wall propagator has large statistical fluctuations 



and severe problems with excited states, as was discussed in Ref. j2l|. To be able to use 
the PP operator to get Cpp directly, rather than indirectly by way of the ratio formula, one 
needs much better statistics. We do this by replacing the point source with a "random-wall" 
source, which simulates many point sources. We set the source on each site of a time slice to 
a three component complex unit vector with a random direction in color space, and use this 
as the source for a conjugate gradient inversion to compute the quark propagator, whose 
magnitude is squared to produce the Goldstone pion propagator. Thus, contributions to 
a meson propagator where the quark and antiquark originate on different spatial sites will 
average to zero and, after dividing by the spatial lattice volume, this source can be used 
instead of Op. 

Figures ^ and El show masses and amplitudes from pion propagators with random-wall 
and Coulomb- wall sources and sinks. In Fig. ^ we can see that extraction of masses from the 
"WW" propagators is almost hopeless. Including an excited state helps, but statistical errors 
become very large. In Fig. |2l the WW amplitudes are also slower to plateau, though not as 
bad as the masses. As a consistency check, note that the WP and PW amplitudes are equal, 
and the masses extracted from the diagonal PP and WW propagators approach their value 
from above (since excited states must contribute to these propagators with the same sign as 
the ground state). As an additional illustration of the difficulties with using the Coulomb- 
wall — Coulomb- wall propagator. Fig. El plots the ratio of the point-point pion propagator 



(using the random wall source) to the alternative PpwPwpI Pww (with a different mass 
than in Figs. n]and|21). While this ratio is approaching one, it is clear that we would either 
need very large minimum time in the fit or a careful removal of excited states to use the 
"WW" propagators. 

Given the problems with the WW propagators, we have opted to use only the Coulomb- 
wall — point-sink and random-wall — point-sink propagators. We performed a simultaneous 
fit to these two propagators, with an amplitude for each propagator and a common mass. In 
these fits the WP propagator dominates the determination of the mass; while the amplitude 
of the PP propagator is required for computing the decay constant. Since the combination 
Cpp/m\ is needed for determining /^^ and the mass and amplitude in a fit to a meson 
propagator are strongly correlated, we used this combination as one of our fitting parameters. 
That is, we fit the point — point and wall — point meson correlators to 

Ppp = mlAppe-""-' 

PwP = mlAwpe-'^-' (6) 

so that App is the desired combination Cpp/m^. Since the correlation between ttt-tt and the 
propagator amplitude is positive, the statistical error on the quantity Cpp/ml. is somewhat 
smaller than a naive combination of the errors on Cpp and m^r. 



III. SIMULATIONS 



These calculations were made on lattices generated with a one loop Symanzik and tadpole 
improved gauge action and an order tadpole improved Kogut-Susskind quark action 



. Parameters of most of the lattices, as well as the light hadron spectrum, are in Ref. 
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Il9l |. The determination of the static quark potential, used here to set the lattice spacing, is 
presented in Ref. [3, ■ In addition to the runs tabulated in Ref. , we now have 
a partially completed run with am' = 0.005 and am'^ = 0.05. (Here and below, the primes 
on masses indicate that they are the dynamical quark masses used in the simulations, not 
the physical masses rh and m^.) In addition, we have results from two runs at a finer lattice 
spacing, a ~ 0.09 fm, with quark masses of arh'^am'^ = 0.0124,0.031 and 0.0062,0.031. 
These runs, with rh' = OAm'g and 0.2m'j,, are analogous to the coarse lattice runs with 
a7h',am'g = 0.02,0.05 and 0.01,0.05 respectively. All of these lattices have a spatial size of 
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am' 1 am'g 


10/5' 


dims. 


lats. 


am^^ 


amx 






0.03 / 0.05 


6.81 


20^ X 64 


262 


0.37787(18) 


0.43613(19) 


0.11452(31) 


0.12082(31) 


0.02 / 0.05 


6.79 


20^ X 64 


485 


0.31125(16) 


0.40984(21) 


0.10703(18) 


0.11700(21) 


0.01 / 0.05 


6.76 


20^ X 64 


608 


0.22447(17) 


0.38331(24) 


0.09805(14) 


0.11281(17) 


0.007 / 0.05 


6.76 


20^ X 64 


447 


0.18891(20) 


0.37284(27) 


0.09364(20) 


0.11010(28) 


0.005 / 0.05 


6.76 


243 X 64 


137 


0.15971(20) 


0.36530(29) 


0.09054(33) 


0.10697(40) 


0.0124 / 0.031 


7.11 


28^ X 96 


531 


0.20635(18) 


0.27217(21) 


0.07218(16) 


0.07855(17) 


0.0062 / 0.031 


7.09 


28^ X 96 


583 


0.14789(18) 


0.25318(19) 


0.06575(13) 


0.07514(17) 



TABLE I: Parameters of the simulations in units of the lattice spacing. The first four columns are 
the dynamical quark masses am'/am'^, the gauge coupling 10 /g"^, the lattice dimensions, and the 
number of configurations used in these calculations. The remaining four columns are the "diagonal" 
pseudoscalar masses and amplitudes, with valence quark masses equal to the sea quark masses. 
The masses shown here come from a separate spectrum calculation, using more source time slices 
than were used in the partially quenched calculations, and using more lattices at am' = 0.03. 
Equation 13 can be used to express these masses and decay constants in units of ri. 

about 2.5 fm with the exception of the am', am'^ = 0.005, 0.05 run, where the spatial size is 
about 3.0 fm. Table H] lists the parameters of the runs used here. 

We note here that the values for am'^ were approximately tuned from the vector to pseu- 
doscalar meson mass ratio in initial runs with fairly heavy quarks. Our best determinations 
of the physical strange quark mass at these lattice spacings turned out to be lower by 8 to 
22% (coarse) and 6 to 12% (fine) than the nominal values m^, where the range depends on 
whether or not taste-violating terms (as determined by SXPT fits) are set to zero before 
demanding that and rrix take their physical values on a given lattice. 

Pseudoscalar propagators were calculated on lattices separated by six units of simulation 
time, using two source time slices per lattice. For the coarse lattices, nine valence quark 
masses were used, ranging from O.lm^ to m^; while for the fine lattices eight masses ranging 
from 0.14m'g to were used. In all but one of the runs, the source slices were taken at 
different points in successive lattices, which leads to smaller autocorrelations than using the 
same source time slices on all lattices. The effects of the remaining correlations among the 
sample lattices were estimated in two ways. First, jackknife error estimates for the masses 
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am' / am'g 


10/5' 


Am2(4) 


A/(4) 






0.03 / 0.05 


6.81 


1.10 


1.16 


0.25 


0.15 


0.02 / 0.05 


6.79 


1.07 


1.00 


0.01 


-0.09 


0.01 / 0.05 


6.76 


1.28 


1.12 


0.30 


0.27 


0.007 / 0.05 


6.76 


1.05 


0.90 


-0.02 


-0.03 


0.005 / 0.05 


6.76 


1.06 


1.20 


-0.04 


-0.04 


0.00124 / 0.031 


7.11 


1.10 


1.13 


0.25 


0.15 


0.00062 / 0.031 


7.09 


1.10 


0.95 


0.22 


-0.01 



TABLE II: Estimates of the effects of autocorrelations. Am^(4)/Am^(l) is the ratio of error 
estimates for the squared pion mass between jackknife estimates with a block size of four and a 
block size of one. A/^(4)/A/^(l) is the Scime thing for tli6 decciy constcint. '^int^m 

and Tintj are 

the integrated autocorrelation times for the squared pion mass and decay constant. All of these 
numbers are averaged over the valence quark masses. 

and decay constants were made eliminating one lattice at a time, and again eliminating four 
successive lattices. Secondly, an integrated autocorrelation time was estimated by summing 
the autocorrelations of the single elimination jackknife results over separations from one to 
five samples (six to thirty simulation time units) r^nt = Xli where Ci is the normalized 
autocorrelation of jackknife results omitting lattices separated by 6i time units. The error 
estimate including the effects of autocorrelations is a factor of ^Jl + Tint larger than the 
error from the single elimination jackknife fit. Table ITTl summarizes the results of these tests. 
The numbers in Table ITTl varv a lot, consistent with the well known difficulties in measuring 
autocorrelations on all but the longest runs. Since we actually expect the autocorrelations to 
be smooth functions of the quark mass, we account for them by increasing all the elements 
of the covariance matrix by an approximate average of these factors squared, (1.10)^, which 
is equivalent to increasing error estimates by a factor of 1.10. 

Propagators were fit to Eq. El using a minimum time distance of 20a for the coarse lattices 
and 30a for the fine lattices. At these distances, the contamination from excited states is at 
most comparable to the statistical errors. For example. Fig. 01 shows results for pion masses 
and amplitudes as a function of minimum fitting distance for one of the fine runs. Since our 
other systematic errors are significantly larger than statistical errors (see Sec. |X|), we can 
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neglect the systematic effect due to excited states. 

For each run, the propagator fitting produced a pion mass and decay constant for each 
combination of valence quark masses. We call the two valence quarks in a particular meson x 
and y\ there are 45 different combinations of mx^uiy for the coarse lattices and 36 for the fine, 
although, as described in Sec. IIX A[ the largest valence quark masses were not used in all of 
the fits. All of the masses and decay amplitudes from a single run are correlated. For each run 
with N samples, a covariance matrix describing the fluctuations of all of these numbers was 
made by doing a single elimination jackknife fit, omitting one lattice at a time, and rescaling 
the covariance matrix of the jackknife fits by (A^ — 1)^. A single elimination jackknife, rather 
than one where larger blocks were omitted, was used because getting a reliable covariance 
matrix requires a number of samples large compared to the dimension of the matrix. Then, 
to account for autocorrelations, this covariance matrix was rescaled by the factor estimated 
above. Finally, to allow simultaneous fitting of the meson decay constants and masses from 
all of the runs as a function of valence and sea quark masses, the covariance matrices from 
the individual runs were combined into a large block-diagonal covariance matrix. (Runs 
with different sea quark masses or gauge couplings are independent, so correlations between 
different runs can be set to zero.) 

Fitting the pseudoscalar propagators produces masses and decay constants in units of the 
lattice spacing a, and to convert to physical units we must estimate a from a calculation of 
some dimensional quantity whose value is known. This amounts to saying that we are cal- 
culating ratios of these quantities to some other quantity calculated from these simulations. 
We express our results in units of a length obtained from the static quark potential, ri, 
where r\F{ri) = 1.0 25|,|26|. This has the advantage that ri can be accurately determined 
in units of the lattice spacing. But ri is not a directly measurable quantity, and its physical 
value must in turn be obtained from some other quantities. We have calculated the static 
quark potential in all of these runs, and fit it to determine ri/a. To smooth out statistical 
fluctuations in these values, we then computed a "smoothed ri" by fitting the ri/a values 
to a smooth function. A simple form, which gives a good fit over the range of quark masses 
and gauge coupling used here, is [3| 

log(ri/a) = Coo + C,o{10/g' - 7) + Coiamt,, + C^oilO/g' - 7f , (7) 
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where rritot = 2m' + m'^. The results of the fit are 

Coo = 1.258(3) Cio = 0.937(9) 

Coi = -0.83(3) C20 = -0.27(2) (8) 

When we need an absolute lattice scale, we start with the scale from T 2S'-1S' or IP-IS 
splittings, determined by the HPQCD group This gives a scale a ^ = 1.588(19) 



GeV on the coarse 0.01/0.05 lattices, and a'^ = 2.271(28) GeV on the fine 0.0062/0.031 
lattices. For light quark masses <ms/2, the mass dependence of these quantities and of ri 
appears to be slight, and we neglect it. With our smoothed values of ri/a, we then get 
Ti = 0.324(4) fm on the coarse lattices and ri = 0.320(4) fm on the fine lattices. 

To extrapolate ri to the continuum, we first assume that the dominant discretiza- 
tion errors go like a^a^. Using av{q*) I28] (with scale q* = it /a) for as gives a ratio 
(Q;5a^)fine/(a5a^)coarse = 0.427. Extrapolating away the discretization errors linearly then 
results in ri = 0.317(7) fm in the continuum. However, taste- violating effects, while formally 
(9(a|a^) and hence subleading, are known to be at least as important as the leading errors 
in some case. Therefore, one should check if the result changes when the errors are assumed 
to go like a^a^. Taking 0:5 = av(3.33/a) gives a ratio (a'^gO^) Rne / {c^s^"^) coarse = 0.375; while 
a direct lattice measurement of the taste-splittings gives a ratio of 0.35. Extrapolating lin- 
early to the continuum then implies ri = 0.318(7) fm or ri = 0.319(6) fm, respectively, in 
agreement with the previous result. For our final result, we use an "average" ratio of 0.4 
and add the effect of varying this ratio in quadrature with the statistical error. We obtain 
ri = 0.317(7) fm. A systematic error of 0.03 fm in ri from our choice of fitting methods is 
omitted since it is common to all our runs and cancels out in the final results here. Using 
our current value ro/ri = 1.472(7), the result for ri implies ro is about 7% smaller than 
the standard phenomenological choice ro = 0.5 fm, although the difference is within the 
expected range of error of the phenomenological estimates 



IV. FIRST LOOK AT RESULTS 

FiguresElandinipresent pseudoscalar masses and decay constants in units of ri as functions 
of the valence quark masses for several different light quark masses. All of these points are 
from the lattices with a ~ 0.125 fm. Figure also contains pion masses where the sea quark 
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mass varies along with the valence quark masses. 

Figures [7| and |H1 show the effect of changing the lattice spacing. For lattice spacings 
a ~ 0.125 fm and a ^ 0.09 fm we show results with m' = 0.4m'^ and rh' = 0.2m'g, again in 
units of ri. The horizontal axis is again the sum of the valence quark masses in the meson. 
These figures also show a crude extrapolation to a = 0, made by taking a linear extrapolation 
in aso^ using pairs of points with the same m! /m'^. In Fig. [7| one pair of extrapolated points 
has diagonal lines showing the data points that were extrapolated to produce this point. In 
hindsight, m'^ used in the a ~ 0.09 fm runs was smaller than that used in the a ~ 0.125 
fm runs, as indicated by the fact that the finer lattice points fall slightly to the left of the 
corresponding coarse lattice points. 



V. TASTE SYMMETRY VIOLATIONS 



As mentioned above, we use the term "taste" to denote the different staggered-fermion 
species resulting from doubling. At finite lattice spacing, taste symmetry is violated. Al- 
though the improved staggered action reduces the taste violating effects to (9(a|a^) from 
0{asa?) with unimproved staggered fermions, the violations are still quite significant nu- 
merically. 

Figure ini shows the splittings between pions of various tastes on our coarse lattices. There 
are 16 such pions, vr^, where B = 5, fi5, fiu, fi, I (z/ > fi) labels taste matrices in the taste 
Clifford algebra generated by Euclidean gamma matrices The tt^ is the Goldstone (pseu- 
doscalar taste) pion, whose mass is required to vanish in the chiral limit by the exact (non- 
singlet) lattice axial symmetry. All the pions in Fig. El are fiavoi- charged, i.e., 7r+ mesons. 
Thus there are no contributions from disconnected graphs, even for the taste-singlet nf. 



The approximate "accidental" SO (4) identified by Lee and Sharpe [l^ is clearly a good 
symmetry: there is near degeneracy between and n^, between and n^j, and between 
TTfj" and ir^. When we assume such degeneracy, we can think of the index B as running over 
the multiplets 5, A,T,V, I with degeneracies 1, 4, 6, 4, 1, respectively. 
The fit in Fig. El is to the tree- level chiral form given in Refs. jl2. 

ml+ = 2fitreerh + o^Ab . (9) 

The slope, /itree? is the same for all tastes, but there are constant splittings for each non- 
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taste (B) 


^1 ( ^ B ) coarse 


(a^As)flnc 
((2*^ )coarsc 


A 


0.205(2) 


0.344(23) 


T 


0.327(4) 


0.353(18) 


V 


0.439(5) 


0.347(22) 


I 


0.537(15) 


0.384(33) 



TABLE III: Mass-squared splittings in units of ri for the coarse lattices, and the ratio of fine to 
coarse splittings. Results from tastes that are degenerate under the accidental 50(4) have been 
combined. 



Goldstone multiplet (A5 = 0). Although the fit is poor (chiral logs, including taste- 
violations, are needed), it does give the pion squared masses within a few per cent: The 
biggest deviation, 7%, is for the Goldstone pion at the lowest mass; most other deviations 
are ~2%. 

Table UTTl shows the values of a^A^ coming from the fit on the coarse lattices. On the fine 
lattices, we have measured non-Goldstone pion masses only on the set with quark masses 
0.0124, 0.031. So we directly compare the splittings with those of the corresponding coarse 
lattice (masses 0.02, 0.05). The fine-lattice splittings are smaller by a common factor of 0.35, 
within errors. This is consistent with the expectation that taste violations go like (9(a|a^). 
Indeed, if we take as = c^viQ*) IsSl and choose q* = ir/a because taste violations occur at 



the scale of the cutoff, we find 

jfine 



{ay{q*=n/a) a^^ 



(ay(g*=7r/a) a'^)coiirse 



0.372 . (10) 



The ratio of taste- violating terms between fine and coarse lattices is an input to the chiral 
fits for Goldstone pions discussed below. The measured splitting ratio of 0.35 is used as a 
central value. The error can be estimated by varying q* in Eq. (fTUI) : q* = 71/ {2a) gives 
a ratio of 0.324; while 27i/a gives 0.398. We take 0.3-0.4 as an appropriate range for our 
analysis of systematics. 

We warn the reader here that the notation in Eq. (jU)) can be slightly misleading. We 
have shown explicitly the factor in the taste- violating splitting, o^Ab, but this does not 
mean that A^ itself is independent of lattice spacing, or even that it approaches a non-zero 
constant in the continuum limit. Indeed, the argument above implies that A^ is a slowly 
varying function of a that goes like ay{7f/a) for small a. A similar comment applies to the 
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other taste- violating parameters introduced in Sec. IVI Al the a? dependence is always shown 
explicitly, but dependence on a through the coupling is hidden. 

In physical units, the splittings on the coarse lattices are quite large. The largest is for the 
taste-singlet pion: a^A/ (450 MeV)^; while the smallest, for the taste axial-vector pion, 
is a^A^ (280 MeV)^. Given the size of these splittings, which are discretization errors, 
it is not surprising that the lattice data is not well fit by continuum chiral perturbation 
theory (XPT) forms. Figure ^1 shows such an attempted fit for the Goldstone f-^ to the 
standard NLO partially quenched continuum form j29| plus analytic NNLO terms. More 
details about this fit will be explained below, when we discuss the corresponding fits that 
take into account taste violations. For the moment, we simply remark that the minuscule 
confidence level (CL ^ 10"^^'^; x^/d-o.f. = 8.77 with 204 degrees of freedom) shows how 
hard it is to ignore lattice artifacts at the level of chiral logarithms. 



VI. STAGGERED CHIRAL PERTURBATION THEORY 



Lee and Sharpe |12| found the chiral Lagrangian that describes a single staggered field. 
Their Lagrangian includes the effects of taste violations at 0{a?) as well as the standard 
violations of chiral symmetry from mass terms at 0{mq), where nig is a generic quark mass. 
They introduced a power counting that considers rriq and to be of the same order, which is 
appropriate here: In Fig. IHlthe splittings are comparable to the squared meson masses. Tree 
level (LO) is thus 0{mg, a^); chiral logs appear at one-loop (NLO) and are 0{rn?, mgO^, a'^). ^ 



The Lee-Sharpe Lagrangian is not directly appropriate to the calcu 



ations here because 



15| have generalized 



it has only one flavor (one staggered field). Aubin and Bernard 

Ref. Q to n staggered flavors and shown how to accommodate the v^Det trick in loop 
calculations. This is what is meant by "staggered chiral perturbation theory," SXPT. 

Continuum chiral perturbation theory can be thought of as an expansion in the dimen- 



^ Throughout this paper, we define the order of a contribution to be the order of the corresponding term 
in the chiral Lagrangian. This is the simplest way to keep the power counting consistent between decay 
constants and meson masses, although it does lead to the unnatural statement that the tree-level is 
"0(771^)" since it comes from the kinetic energy term in the chiral Lagrangian. What matters ultimately 
is only the relative size of contributions: the first correction to the tree- level value of or /^r is smaller 
by one power of mq. 
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^9 — o^2/2 • (-'■-'■) 



sionless quantity 

8^ 

where 2^mq is the tree-level mass of a qq meson. For physical kaons, we expect the relevant 
quark mass parameter to be Xud,s = {Xud + Xs)/2 ~ 0.18 (where Xud is the average value for 
the u and d quarks); this is reasonable given the experimental result fK/f-K ~ 1.22. 

Staggered chiral perturbation theory is a joint expansion in Xq and Xa?^ which measures 
the size of the O^o?) taste violations: 

= 8^ ' ^''^ 
where a? A is a "typical" taste-violating term. Taking for a^A the average meson splitting 
(see Eq. (PHj) below), we have a?/S. ^ (350 MeV)^ and Xa^ ~ 0.09 on the coarse lattices; 
a^A (200 MeV)^ and Xo? ~ 0.03 on the fine lattices. If one instead uses the larger of 
the 0{a^) taste- violating hairpin parameters |l4, 15|, a^(5^, to estimate A and Xa^i o^i^ gets 
slightly smaller values. 

A. NLO forms 

One-loop chiral logs and analytic terms have been calculated in SXPT for Goldstone 
meson masses [l^ and decay constants [l^. Partially quenched results are included, so all 
;he numerical data are available. 



forms needed to fit t 

Q 



References [14, 



15l | express the chiral logarithms in terms of 



2 

m 



^{m^) = j^ln^ + 5i(mL)J (13) 

i{m^) = + 5;{mL) , (14) 

where Av is the chiral scale, and L is the spatial dimension. The finite volume correction 

n 

terms 6i and ^3 are [13|| 

SsimL) = 2^i^o(|r1mL) , (16) 

where Kq and Ki are Bessel functions of imaginary argument, and r, which labels the various 
periodic images, is a three-dimensional vector with integer components. We have assumed 
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here that corrections due to the finite time extent are neghgible; this is true for our lattices, 
for which the time dimension is between 2.7 and 3.4 times greater than the spatial dimension. 
The function C.{m?) in Eq. (|13|) arises from tadpole diagrams with a single meson propagator; 
[{m^) in Eq. (jl4j) comes from double-poles, which are present only in the partially quenched 
(and quenched) cases, not in the full QCD limit. In practice, we compute the sum in Eq. (|15p 
or Eq. ()16p with cutoff \r\ < N, where is an integer, and increment by 1 until the sum 
changes by a fractional amount < e. To be conservative, we take e = 10~^ for central-value 
fits. However, a much weaker criterion, e = 0.001, is adequate to reduce the error in the 
sum well below our statistical errors, and we often use the weaker criterion for alternative 
fits in the systematic error estimates. 

In the generic case relevant to our data (m„ = = m 7^ and no degeneracies 
between valence and sea quarks), the NLO SXPT expressions for a meson P composed of 
valence quarks x and y are jl4 , Q 



NL0^2 



(m^ + TUy) 



j j / 



16/x 



tree 



NLO 




(2^8 - L5) (m^ + ruy) + 



32/i 



tree 



{2Lq — L4) (2m + rris 



1 



Q,B 



j 



j 3 J 



(17) 



,[3] 



[3,2] 



,[3] 



,[3] 



[3,2] 



,[3] 



+2 5^i?f 1({A^!JW})^K) ) +[v^a)+ a\L" - L') 



8/itree j , , x , 16/itrce r /r, - , 

-L5 (m^. + my) H — — L4 (2m + m^ 



P 



P 



(18) 
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Here fi and / are the continuum chiral parameters, and 5^ are LO taste-violating param- 
eters (hairpins), Li are the NLO Gasser-Leutwyler [l] coefficients, and L' and L" are hnear 
combinations of the taste-violating NLO coefficients. The reason for using the tree-level 
Utree parameter from Eq. © in the Lj terms will be explained in Sec. IVl Bl X= and are 
flavor-neutral mesons of taste S made of x, x and y, y quarks, respectively, and D^, and 
S-E are corresponding flavor-neutral mesons made from u, d, and s sea quarks, respectively. 
The index Q runs over the 6 mesons made from one valence and one sea quark, and B runs 
over the 16 meson tastes. The residues i?^"'^' and -Oj"'^' in Eqs. (fTTj) and (fTHj) are defined in 
Refs. Q,ll3. For completeness, we quote them here: 

Rf'"^ {{M}M) ^ na=l(/^a-^|) _ ^^g^ 



{{MYM) ^ -^rV^ • (20) 

Each of these residues is a function of two sets of masses, the "denominator" set {M.} = 
{1711,1712, ... ,mn} and the "numerator" set {fi} = {yUi, /i2, . . . , /Xfc}. The indices j and i, 
^ ^ ^ n, refer to particular denominator masses; the prime on the product in the 
denominator of Eq. (fT^ means that i = j is omitted. 

In Eqs. fll7|) and p8p. the denominator mass-set arguments are shown explicitly; they are 



{Ml.} 


= {mxj,m^j} , 




= {mYj,mnj} , 




= {mxj,mYj,mrjj} , 


{M'l} 


= {mxy,mr,^,m^i^} , 


{M'l) 


= {mYy,mrjy,mr,'^} , 




= {mxv,mYy,mr^y,mrf^} 



(21) 



The index j in Eqs. (|17|) and (|18p is summed over the denominator masses. Sets for axial- 
vector taste {A) are found from the corresponding vector taste {V) sets by taking V ^ A 
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in Eq. (j^U- The masses m.^^, m^^, m^' are given by 



2 "^i/, , 2m| 

= I (ml, + m% + + Z ] ; (22) 



The numerator mass-set arguments of the residues in Eqs. (|T7jl and ()18j) are not shown 
exphcitly because they are always 

{/i|l} = {mt;,,msj , (23) 

where the taste label S is taken equal to the taste of the denominator set. 

Degeneracies among the various masses in Eqs. (fT7j) and (fTBj) occur quite often in our data 
set. In particular, "partially quenched pions" have rrix = my and hence mxg = for each 
taste B. Similarly "partially quenched kaons" have rriy = and hence myg = rngg. Going 
to full QCD introduces additional degeneracies irtx = fny = mu (for pions) or mx = ttlu 
(for kaons). Further, the accidental degeneracy rriYj = m^j appears in our data when 
amy = 0.04, am' = 0.02, am'^ = 0.05 (coarse) or amy = 0.0248, am' = 0.0124, ami = 0.031 



(fine). Formulas for many of these degenerate cases appear in Refs. |l4l lisjl. For the other 
cases, one can carefully take appropriate limits in Eqs. (fTTj) and (fTHj) . or, more conveniently, 
return to the orig 
momentum integrations. 

Because Eqs. (ITTjl and (fT^ are quite complicated, it is useful to write down a simple 
result that shows more clearly how taste violations change the continuum chiral behavior. 
The pion decay constant in full QCD with two (degenerate) flavors {m^ = my = mu = = 
rh, with the strange quark integrated out) is particularly simple. In that case, the result 
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corresponding to Eq. (fTSj) is 



\ B 



-4 (^{mi^,^) - iirni^)] - 4 ( ^(mj, ) - £(m^J ) + a\L" - L') 

+^L5(2m) + ^L4(2m)| , (24) 
with B running as usual over the 16 possible tastes, and 

2 2 2 

ng Ub 1 

2 2 I 1 2r/ 

m\ = ml^ + . (25) 

In Eq. fl24j) . the term multiplied by —2 gives the average of all tastes and becomes the 
standard SU{2)l x SU{2)r chiral logarithm in the continuum limit, when all tastes are 
degenerate. The terms multiplied by —4 clearly vanish in the continuum limit because 
m^^ = m^i^ and m^r^ = m^^ when a?5'y = = a^5^. 

Since m' is significantly less than m'^ for many of our runs, Eq. (j21I) is often not a 
bad approximation to the chiral behavior of our (full QCD) data. It will be useful in the 
discussion of finite volume effects in Sec. IIX D 61 

isl ] explicitly include in the chiral Lagrangian the effects 
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We note here that Refs. 
of terms in the 0{a?') staggered- quark Symanzik action that violate the taste symmetries. 
There are also "generic" 0{a?) terms in the Symanzik action that have the same symmetries 
as the continuum QCD action and are not included explicitly. An example is o?il)D'^D^{'-)^® 
I)ip, where 7^ and / act on spin and taste indices, respectively. The effect of such terms 
on the chiral Lagrangian is to produce O{o?) variation in physical parameters such as /, /x, 
and, at higher order in m^, L,. We build the possibility of such generic variation in physical 
parameters into the chiral fits below. ^ Since our staggered action is tadpole improved, 
we expect such generic variation to be of size a^a^AqcD ~ 2%. When we extrapolate the 
physical parameters to the continuum, we will need to know how aso? changes from the 



^ Before comparing its values for different lattice spacings, the parameter must be renormalized by the 
inverse of the mass renormalization constant. See Sec. IVIll 
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coarse to fine lattice. As in the case of taste violations, such discretization errors occur at 
the scale of the cutoff. Therefore, we use as = (y.y{q* = -n/a) for central values, and allow 
q* to vary between vr/ (2a) and 27r/a for the error estimate. We have 



(av(g*=7r/a) a^) 



fine 



0.427 



(26) 



(ay(g*=7r/a) a^)coa.rs~ 
and a range for this ratio of 0.398 to 0.441. 

As they stand, Eqs. ()17|) and pSj) are slightly inconvenient because the renormalization 
of the O{mq0^) analytic NLO parameters L' and L" under a change in the chiral scale A^ is 
complicated and involves the physical Lj parameters. This is due to the fact that the meson 
masses multiplying the logarithms include O(a^) splittings. It is more natural, therefore, 
to redefine the V and L" by associating particular 0{a^) terms with the Lj. We make the 
replacements 



16/Xt 



P 

32/itr( 

~1 



{2Ls - L^){m^ + niy) 
2 (2L6 - Li) (2m + nis) 

S/^tree j , , \ 

— (m^ + my) 

16/Xtree r - i ^ 

— — — L4 (2m + rris) 
I 

where a^A/ is given by Eq. 0, and 



16 

— (2^8 - L5) fitme{m^ + my) + a^Ai 
32 



— (2^6 - L4) /itree(2m + TUg) + i^a Aj 



p 

16 

p 



/itree(2m + TUs) + f a^A 



a^Aa, = — (A5 + 4A^ + 6At + iAy + A,) 
16 



(27) 



(28) 



is the average splitting. After these redefinitions, a change in A^ renormalizes L' according 
to: 

L'(A,) = L'(A,) + 2{S'^ + S'y) H~K/K) 5 (29) 

while L" is independent of scale. From this we would expect that L' is comparable in size 
to 6'y^ + S'y, an expectation that is borne out by the fits. The renormalize by: 



L,(AJ =L,(AJ + 



(30) 



with 



2Cr — Ca 



- -1 

-2/9 



C5 = -3 ; 
2Cs — C5 



4/3 . 



(31) 
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B. NNLO Terms 



As we will see below, the high statistical precision of our data requires us to go beyond 
the NLO formulas, even for subsets of the data that include only the lighter valence quark 
masses. We include explicitly all NNLO physical analytic parameters, i.e., all analytic terms 
of 0{m^). There are five such terms for mp+ and an additional five for fp+ ^|. Expressed 
in terms of Xq defined in Eq. they are given by: 

= ^(l + NLO + (x. + Xyf + i2xud + Xsf + /^f^ (X. + Xy) {^Xud + Xs) 

+ ix. - Xyf + /^r^ (2xL + X^) ) , (32) 
^NNLO ^ J A + NLO + (x. + Xyf + (2x«d + Xs)' + P'i^ {X. + Xy) {'^Xud + Xs) 



+ ix. - Xy)' + /^F^ (2xL + X's)), (33) 

where "NLO" denotes the lower order contributions, (the corrections to the leading "1" in 
Eqs. (II 7j) and (|18|). with the substitutions in Eq. (j27p ). The interchange symmetries among 
valence quarks x ^ y and sea quarks u ^ d ^ s ^ u restrict the form of the NNLO 
corrections. These terms were obtained independently in Ref. js^. 

Possible analytic taste-violating terms at NNLO of (9(m^a^) are included implicitly by 
allowing the Lj {0{m'^) terms) to vary with lattice spacing.^ However, such variation can 
be caused either by taste-violating terms in the Symanzik Lagrangian or by terms with 
the same symmetries as the continuum operators but with explicit factors of {i.e., by 
generic discretization errors on the Li). As explained above, the generic discretization errors 
are expected to be ~ 2%; while the new taste-violating terms could change the apparent 
value of Li between coarse and fine sets by order x'^J^^^'^^ — X^i'^ ~ 6%. For our preferred 
fits, we use Bayesian priors ^ to restrict the differences in the Lj on coarse and fine 
sets to be at most 7.5% (for a 3-cr variation); while in alternative fits used to assess the 
systematic error, we relax or tighten this restriction. (See Sec. IIXBI ) Note that when we 
extrapolate the Li to the continuum, we have no a priori way to distinguish variation as 
asd"^ (generic discretization errors) from variation as a|a^ (taste violations). Therefore, we 



It is not hard to show that allowing the Li to vary with a generates all possible O(m^a^) contributions to 
masses and decay constants of Goldstone pions at rest. However, it is at this order that Lorentz- violating 
terms can affect the Goldstone pions so one would expect to find slight differences at fixed lattice 
spacing between masses and decay constants calculated here and those for pions of non-zero 3-momentum. 
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consider both types of variation {i.e., Eqs. ()26|) and ()1U|). as well as the ranges of these 
ratios) and include the difference in the systematic error. In practice, these alternative fits 
and these assumptions in how the Li are extrapolated to the continuum contribute only a 
small fraction of the total systematic error. (See Table rVTTl ) 

At NNLO, 0{mqa^) analytic terms may also affect m? + and /„+. We neglect such terms, 
which would make contributions similar to that of V and L" in Eqs. fll7|) and (fTHj) . but 
multiplied by instead of o?. Because Xa^ is only 0.09 in the worst case, we would expect 
the error thereby induced to be at most x^a? ~ fact, since we consider a generous 

range for how taste-violating a? terms may vary as we go from coarse to fine lattices (see 
discussion immediately following Eq. (HHI)), the effects of OimqO^) analytic terms should 
already be included in our systematic error estimates. 

The final possible NNLO terms in the Lagrangian are 0{a^). However, it is easy to see 
that such terms do not contribute to and fp+. The Goldstone theorem requires that 
be proportional to at least a single power of quark mass (m^ + iriy in this case) , and 
terms in the Lagrangian must have at least two derivatives to make analytic contributions 
to fp+ through Noether's theorem or wave-function renormalization. 

In addition to analytic terms at NNLO, there are, of course, NNLO chiral logarithms 
(from 2-loop graphs, as well as 1-loop graphs that involve NLO parameters). These non- 
analytic terms have not been calculated in SXPT, but in any case are not expected to be 
important here: Wherever the quark masses or splittings are large enough for the analytic 
NNLO terms to be significant, the NNLO logarithms should be slowly varying and well ap- 
proximated by analytic terms. As discussed in Sec. lIXBj the NNLO terms make a difference 
primarily in the interpolation around m^, not in the extrapolation to rh. The systematic 
errors inherent in our treatment of the NNLO terms are estimated by varying the masses 
we fit to and the Bayesian priors governing these terms and their changes with a, as well as 
by adding still higher (NNNLO) terms. 

There are also NNLO effects induced by the ambiguity in the parameters one puts 
into NLO expressions. In particular, we have at present expressed the "chiral coupling," 
l/(167r^/^) in Eqs. p7|) and (fTHj) . in terms of bare (tree-level) parameter /. Replacing / 
with the experimental value of f.,^, say, would generate a difference at NNLO. As we discuss 
below, the difference between /^r and / is significant: 13%. If we had the full NNLO 
expression, including 2-loop effects, then the ambiguity would be resolved up to terms of 
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NNNLO. But in the present case there is no a priori way to decide this issue. 

We argue, however, that putting a physical parameter in the chiral couphng (/ /tt) 
is hkely to result in a better convergent XPT. This is similar to the argument for using a 



24| . In practice. 



physical, rather than bare, coupling in weak-coupling perturbation theory 
we consider three versions of the fits: 

(1) Fix coupling as l/(167r^/^). 

(2) Leave coupling as l/(167r^/^). 

(3) Write coupling as uj / {IGn"^ f^) and treat uj as an additional fit parameter: either 
allow it to vary freely or force it to vary around 1 using Bayesian priors. 

Good fits are possible with all three choices. Both because of the argument above, and 
because it guarantees that the NLO chiral logarithms for very light quarks have the expected 
weight relative to the tree level terms, we take choice (1) for our central values. Choice (3), 
with its extra parameter, results in the highest confidence levels of the three. When u is 
allowed to vary freely, its value decreases as higher quark masses are included in the fits, 
reaching u ~ 0.6 by set // (see Sec. IIX A]) . This is similar to replacing / fx, perhaps not 
surprising for fits that must cover a range of valence masses up to a large fraction of m^. 
But for the quantities computed here, all of which are sensitive to the chiral behavior at low 
quark mass, we do not include fits with u free since we expect 1/ (IGvr^/^), not 1 / {IGn^ f]^) , 
to be the correct weight for the logarithms in the low mass regime. We still allow fits 
with range u = 1.0 ± 0.1 because 10% is roughly the difference between the physical 
and its value in the chiral limit. As discussed in Sec. |3 the main effects of including fits 
with arbitrary u would be to increase the systematic error in by about 1 MeV (with a 
corresponding effect on fx/ fix) and to double the simulation error on iriu/md (which error, 
however, is small compared to unknown electromagnetic effects). 

Good fits with choice (2) require f > fx (equivalent to ~ 0.6 in choice (3)) and quite 
large NNLO terms. In addition, the O(a^) NLO parameter a^L" becomes unreasonably 
large (~ (630 MeV)^ on the coarse lattices). For these reasons we exclude choice (2) fits 
from the systematic error estimates; including them would increase systematic errors by 
amounts comparable to those of arbitrary u fits. 

Similar considerations apply to the parameter /x. In the analytic terms involving the Li in 
Eqs. ffT7|) and (fTH|) (and hence in Eqs. (fH^ and ((221) ), we argue that it is best to put in /^tree 
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from the linear (tree-level) fits, Eq. The Li are then multiplied by actual squared meson 
masses (within the errors of Eq. Q). This corresponds to how such terms are interpreted in 
continuum XPT analysis (see, e.g., Ref. Q]). An alternative, a posteriori, choice would be 
to use the chiral limit of m^/(2m) coming from the full NNLO fit. This would replace //tree 
in Eqs. (|17j) and (jl8|) by a number 5% smaller. Since Eq. Q gives a maximum of 7% errors, 
we choose that larger value as the systematic effect. It would however be unreasonable to 
replace /itree in Eqs. (fTTj) and (fTH|) or (Eqs. (jH^ and ((221)) by the fit parameter fi itself. That 
is because the effective value of fi is corrected by terms involving that do not go away 
in the chiral limit for the light quarks. Indeed, one might expect corrections of at least 
Xud,s + xid s ~ 20%. In practice the fit parameter /x from our preferred NNLO fit on the 
intermediate valence mass set (subset //, Sec. ll-X Al is 29% less than fitree, which means that 
fi{mx + rriy) is significantly less than our measured value of m? + = ^treeij^x + t^v)- This 
difference improves to 13% in the fit to the lightest masses. 

As discussed in the introduction, all meson masses appearing in the NLO chiral logarithms 
in Eqs. (fTTjl and ()18|1 are similarly evaluated using the previously determined values of the 
taste splittings, o^Ab, and /itree from the fit of our "full QCD" data for all meson tastes 
to Eq. Q. In our results for masses and decay constants, the NNLO error introduced by 
this procedure is negligible. That is because the effect of the small errors in masses in the 
chiral logarithms on our extrapolated values is almost completely canceled by the effect of 
variations of the analytic parameters in the fit. We can check this by replacing /itree in the 
fit by the (5% different) chiral limit value; the effect is about 0.2% on quark masses and 
less than 0.1% on the decay constants, in both cases much less than the total systematic 
error. For the Lj, changing the value of /i in the chiral logarithms does not completely 
cancel the effect of changing its value in the analytic terms, but there is some cancellation. 
Therefore the 7% systematic effect in the Li discussed in the previous paragraph remains a 
conservative estimate of the error. 
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C. NNNLO terms 



We sometimes add some NNNLO terms of the following form: 

,NNNLO\2 



— = /.(l + NLO + NNLO + p("^) (x. + Xyf ) , (34) 
^NNLO ^ J + NLO + NNLO + p(^) (x. + Xyf), (35) 

where NLO and NNLO represent the contributions from Eqs. (fT7j) . (fTH|) . (j!?^ and (jSHI)- 
Since there are, of course, many additional NNNLO terms, it is nonsystematic to include 
only one each for mp+ and fp+. However, we pick these terms involving valence masses 
because there is a steeper dependence on the valence masses than on the sea quark masses. 
For lower quark masses, where we expect XPT to work well, we fit to Eqs. and (jH^I) only 
to estimate systematic errors due to the truncation of XPT. When the fits include valence 
masses equal to or greater than m^, we also use Eqs. fl34j) and in order to improve the 
interpolation around the strange quark mass. In the former case, we find that the values 
of Ip*--^^! and |p*-™^| coming from the fits are typically less than 0.1; in both cases they are 
always less than 0.2 (including when we fit to Eqs. (jHUI) and (jTTj) — see Table HV]) . 

Another form, used only for interpolations around the strange quark mass, adds on the 
square of the NLO term as a mock-up of the effect of 2-loop chiral logarithms: 

(^NNNLO')2 

= p( 1 + NLO + NNLO + (NLO)' + p(™) (x. + Xy)' ) , (36) 

(m^. + my) \ J 

fNNNLO' ^ J + NLO + NNLO + a(^) (NLO)' + p(^) (x. + Xyf ) , (37) 

where again NLO and NNLO represent the contributions from Eqs. ()17p. (fTHj) . ()32j) and 
fl33|) . The absolute values of the new coefficients o"'-'"^ and cr*--^-' in the fits are never greater 
than 0.14. 



VII. PERTURBATION THEORY 



Because the axial current we use to compute decay constants is partially conserved, there 
is no need for current renormalization. Mass renormalization is however needed to find 
continuum (MS) quark masses, as discussed in Ref. 2^. Let be the mass renormalization 
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factor that connects the bare lattice mass (am)o and the MS mass at scale A:^ 

(am)o 



auQp 



(38) 



Here, unlike in Ref . Qf^ , we have shown explicitly the plaquette tadpole improvement factor 
Mop, necessary because the MILC improved staggered action defines the lattice quark mass 
in a somewhat unconventional manner. 

The renormalization factor enters the analysis in another way. As mentioned above, 
we need to renormalize the parameter /i if we wish to compare its values at different lattice 
spacings. More precisely, we need the ratio 



^A) uf"" 



''OP 



coarse 
''OP 



(39) 



Rm is in principle independent of A, although when is evaluated at any given order in 
perturbation theory, there is residual A dependerice from neglected higher order terms. For 
definiteness, we take A = 2 GeV. Zm is given by 



ZrniaA) = (1 + av{q*) Zi^\aA) + 0{a^)) 



(40) 



where ay is determined from small Wilson loops using third order perturbation theory 



2^. 



is 
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;he optimal scale q* is estimated using a second order BLM method 



35, 



and Z 



(2) 



In(aA) 



(41) 



with b ^ 0.5432, correct to 0.1%. We have neglected the (tiny) 0{a) mass dependence of 
b, and hence of Z^^aA). From Ref. [22|, q* = 2.335/a and ay(g*) = 0.252(5) on the coarse 
lattices; q* = 1.80/a and av{q*) = 0.247(4) on the fine. 

To evaluate Rm, we use scale and plaquette values from the coarse 0.01/0.05 and the fine 
0.0062/0.031 lattices, and neglect the the smal 



mentioned previously, the T splittings give 
2.271 GeV. The tadpole improvement factor are Mgp 
Eqs. dSni), (001) and (HH), we find Rm ^ 0.958. Then, 



variation among the coarse or fine sets. As 
121, El (a™''''^^)-^ = 1.588 GeV and (a^"""^-^ 



0.8677 and u^f 



0.8782. From 



Rmf^ 



fine 



(42) 



^ A was called ^ in a notation we avoid here for obvious reasons. 
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with the above value of Rm defines what we mean by "equahty" of the parameter /i on coarse 
and fine lattices. Of course, Eq. ()42|) may be violated by generic O(a^) scaling violations 
(~2%), as well as by perturbative errors. A priori, one expects a two-loop correction to 
Rm of order ay. This is ~ 6%. In practice, fits have a confidence level that is higher 
than those of our preferred fits if we take R^ ~ 0.87 to 0.89, ji.e., a 7 to 9% difference 
from Rm = 0.958. Although it is not possible to separate the perturbative errors from the 
discretization errors in this difference, here and in Ref. 2^ we take the larger value, 9%, as 
the conservative estimate of perturbative errors. This is ~1.5ay. For quantities that do not 
directly involve perturbation theory, such as the decay constants and the ratio m„/mrf, we 
do not quote perturbative errors, per se. But Rm still enters the chiral fits, so we include 
fits with Rm = 0.87 among the alternatives. 

Another rough estimate of Rm comes from /itree, Eq- ©• Without the proliferation of 
parameters at NLO and NNLO, the tree-level form makes possible well-controlled fits on 
coarse and fine lattices separately. We get Rm ~ 0.977. But note that Eq. (jH)) can have up to 
~ 7% errors in describing the data, and there are also discretization errors in this estimate. 

VIII. ELECTROMAGNETIC AND ISOSPIN- VIOLATING EFFECTS 

Given the precision we are aiming at here, it is necessary to take into account electro- 
magnetic (EM) and isospin- violating effects, at least in an approximate way. Our simulation 
is in isospin-symmetric QCD, with m„ set equal to m^, and the electromagnetic coupling, 

= AnaEM, set to 0. This means that when we compare meson masses to experiment 
to determine the physical quark masses m and m^, we must first adjust the experimental 
numbers to what they would be in a world without EM effects or isospin violation. This 
is particularly important for the pion, since the difference between and m^o is almost 
7%. Because the adjustment is only approximate, there are some residual systematic errors 



on the quark masses, as discussed in Ref. 
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The decay constants, as well as the low energy constants Lj, are by definition pure QCD 
quantities, so we do not have to take EM effects directly into account in our determination.^ 
Nevertheless, there are indirect EM effects on and J'k, which come in through the quark 



^ However, the EM corrections must be explicitly evaluated when the decay constants are compared to 
experiment [s^lsst. 
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masses when we extrapolate to the physical point. Isospin violations are irrelevant for the 
Lj, which are defined to be mass independent. But for the decay constants, there are both 
direct and indirect isospin- violating effects, which we estimate below. The end result is that 
both the (indirect) EM and isospin-violating errors on decay constants are very small, as 
long as we are careful to extrapolate to the appropriate values of the quark mass in each 
case. However the EM error on niu/md is large unless we are willing to assume that the EM 
effects on meson masses are accurately known. 

Electromagnetism can be included in XPT in a systematic way. Dashen's theorem 
summarizes the EM effects on meson masses at lowest nontrivial order in and the quark 
masses. It states that and receive equal 0{e^) contributions in the chiral limit; 
while the 7r° and masses are unaffected. However, there can be large and different EM 
contributions to and of order e^Xud,: 

Ae parameterize violations of Dashen's theorem: 



0,13, 42, Q 



Following Ref. |4J], we let 



(m^+ - m^o)EM = (1 + ^E){ml+ - m^o)EM ■ (43) 

Then Refs. suggest A^; 1 Most of these corrections are probably to the 

charged meson masses. Indeed, the violation of Dashen's theorem for the 7r° is 0{e^Xud) 
4l| and therefore small. The EM contribution to m^o, on the other hand, is in princip 



the same order as the violations of Dashen's theorem for the charged masses, e^Xud,s 41 1. 
Nevertheless, a large iVc, extended NJL model calculation finds a tiny EM correction to 
the mass at this order. To be conservative, though, we allow for EM contributions to 
m^o of order of half the violations of Dashen's theorem, with unknown sign: 

("^^o)em ~ ±(Ai5/2)(m^+ - m^o)EM ■ (44) 

The effects of isospin violation in the pion masses are quite small. When ^ rrid, 
gets a contribution of order {xu —JCd)'^- The isospin-violating splitting (m^+ — m^o)QCD is 
estimated as 0.17(3) MeV in Ref. |l|, and as 0.32(20) MeV in Ref. Q- In the kaon system, 
on the other hand, the effects of isospin violation are clearly important, as is obvious from 
the fact that the experimental K^-K^ splitting is of opposite sign to that in Eq. (j43j) for 
any A^; > — 1. In our calculation, we can reduce the isospin violating effects in the kaon 
masses to the same order as in the pion system by focusing on the isospin averaged quantity 
(^/^o + '^/^+)/2. We then neglect the remaining isospin violations in the meson masses. We 
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have checked, using the estimates for {m^+ —m^o)QCT) above, that the indirect effect of such 
isospin violations on decay constants is extremely small: <0.03%. These isospin violations 
were also neglected in the computation of quark masses in Ref. 2^. We note, however, that 
including isospin violations could have some small effect there, in particular on the result 
for the ratio nis/rh. If {171^+ — 'm7ro)QCD is at the upper end of the range in |45], the central 
value for m^jrh in Ref. |22] could be changed from 27.4 to as low as 27.2. 

Based on the above discussion, we may determine the physical values of ra and by 
extrapolating the lattice squared meson masses to m| and m^, given by 



2 2 



i (m^o + m^+ - (1 + As)(m2+ -m^o)) , (45) 



where experimental values are to be used on the right hand side. Allowing for EM corrections 
to the mass, Eq. (j44j) . replaces A^; ^ 1 in Eq. (j45j) with an effective value in the range 
0-2, which is in any case a conservative range for A^; that includes the Dashen theorem 



result. Here and in Ref. [2^, we take A^ = 1 for the central value, and use < A^; < 2 to 
estimate systematic errors in m, mg, and their ratio. 

We can also estimate (or equivalently the ratios mu/fnd or mu/rh) from our simulation. 
Given m^, we find by extrapolating in the light valence mass to the point where the 
has the mass {rnx+)QCD, where "QCD" indicates that EM effects have been removed. We 
take 

("^/f+)QCD = m\+ - (1 + 5E){rnl+ - m^o) , (46) 

with 5e = ^ our central value, corresponding to A^; = 1 and vanishing EM correction to the 
K'^ mass. If we attribute the uncertainty in the effective value of A^; = 1 to the uncertainty 
in Eq. then we get a range 0.5 < 5s < 1.5. This produces only a small uncertainty 

in rriu/md because the variations in (m^+)QCD and m|. are equal. A more conservative 
assumption is that A^; arises primarily from EM contributions to the mass. This implies 
6e = and thus < 5^; < 2, which we take as the range for estimating EM systematic 
errors in 171^/171^. Under this assumption those errors are quite large, ~ 20%. On the other 



421, this error would be 



hand, if we were for example to take A^; = 0.84 ± 0.25 from Ref. 
reduced to ~5%. 

There is an additional error on m^/m^ because we keep the light sea quarks with fixed 
masses m„ = rud = m as we extrapolate in the light valence mass to (?77,|,+ )qcd- The effect 
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produces a fractional error in mu/md of 0{{mu — rrid)'^), i.e., of NNLO. This is because 
terms of 0{mu — md) cancel when expanding m„ and rrid around m = (m„ + md)/2. We 
estimate the size of this effect using the NNLO analytic terms; from Eq. (j32j) the only 
relevant coefficient is /^g™""*. This term gives a fractional error (3^"^\xu + Xd ^ '^xid)- Using 
Xud ~ ""^^/(Stt^/^) ~ 0.014, our result rriu/md ~ 0.43, and /Jg'"'' ~ 1.86 in the continuum 
limit from Fit C (see Sec. IIX CI and Table HVj) . we find a negligible error :^0.01%. 

Our simulation directly determines decay constants of the charged mesons, vr"^ and K~^, 
in the absence of electromagnetism and with m„ = m^. Since is (approximately) the vr"*" 
mass in this limit, we must simply extrapolate /7r+ to the point where the mesons have the 
masses in Eq. (j45j) . i.e., to our physical values of m and m^. The situation with the kaons is 
rather different. It is that is measured experimentally, not some isospin averaged decay 
constant of and K^. We therefore should extract by extrapolating the light valence 
quark to the physical value of m^, not m. Despite the large uncertainty in m„/m^ from EM 
effects, the indirect error induced in through m„ is tiny, ~0.07%. This is due to the fact 
that the decay constant changes slowly with mass: It varies only by 20% all the way from 
the K to the vr. 

In principle there are also direct isospin- violating errors in the decay constants. For fK+ , 
there is an effect we can estimate from the coefficient /3k, , Eq. ()33p. similar to the one in 
rriu/md from Since /3g'^^ in our fits is ~— 0.1, i.e., much smaller than this effect 

is completely negligible. For /^+, errors can also arise from the coefficient (31 because we 
assume = rrid. But this coefficient is ~— 0.06, again leading to a neghgible effect. 

IX. SXPT FITS 

We fit the partially quenched data for mi^/^rrix + m„) and fp+ together in all fits; this 
helps to constrain the common O(a^) chiral parameters. Similarly, both coarse and fine data 
are fit together, helping to constrain the overall lattice spacing dependence. Correlations 
between and among masses and decay constants within each sea-quark set are included, 
with the covariance matrix computed as described in Sec. IIIII 



31 



A. Data subsets and fit ranges 



Our lattice data is very precise (0.1% to 0.7% on m^+/(ma; + my), and 0.1% to 0.4% on 
/p+); while the XPT expansion parameter for the kaon, Xud,s^ is ~0.18. Since xlids ~ 0.03, 
we cannot expect NLO XPT, which is missing corrections to rn^p+l (f^x + fny) or fp+ of order 
Xq, to work well for meson masses that are even an appreciable fraction of rriK- NNLO 
XPT, however, may allow us to fit up to fairly near itlk, because the missing corrections 
at the kaon, Xuds ~ 0.006), are comparable to (but somewhat larger than) the statistical 
accuracy of our data. Of course, this is only a rough guide. There are at least two sources of 
complications: (1) It is an idealization to imagine that the chiral expansion is governed by a 
single mass parameter. Many different mesons contribute to chiral loops. Although we can 
restrict the valence masses in the fit, the s sea quark mass in the simulations is fixed at m'^. 
Thus there will always be some contributions from fairly heavy mesons. (2) Taste violations 
produce additional contributions to meson masses, or, effectively, add another expansion 
parameter, Xa2, Eq. (fT2|l . 

In practice we consider three different subsets of our complete (coarse and fine) partially 
quenched data set. Compared to the strange sea quark mass in the simulations, m'^, we 
can tolerate somewhat heavier valence masses on the fine lattices, since on those lattices 
exceeds rris by a smaller amount and contributions to meson masses from taste splittings 
are smaller. The sets are: 

• Subset /. 

rrix + my < 0.40m'^ (coarse), and m^ + my < O.bAm'^ (fine). 
47 valence mass combinations; 94 data points. 

• Subset //. 

+ my < 0.70m'^ (coarse), and m^ + my < 0.80m^ (fine). 
120 valence mass combinations; 240 data points. 

• Subset ///. 

mx + my < l.lOm'^ (coarse), and m^ + my < l.lAm'g (fine). 
208 valence mass combinations; 416 data points. 

There are always twice as many data points as mass combinations because we are fitting 
m^+/(ma, + my) and fp+ simultaneously. 
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On the fine lattices, m(, is about 10% larger than m^, so we expect errors of order (0.54 x 
1.1 X 0.18)^ ~ 1.1% at NLO in subset /, without even considering the effects of mixed valence- 
sea mesons or taste violations. Thus we do not anticipate that the NLO form, Eqs. (jl7|) and 
(jl8|) . can fit the data, even on subset /. Good fits should require at least the NNLO forms, 
Eqs. (jH^ and on all sets. Indeed, fitting with Eqs. (fTTj) and (fTH|) on subset / gives 

minuscule confidence levels (CL < 10~^^; x^/d.o.f. = 6.39 with 74 degrees of freedom), and 
adding in those NNLO terms that involve sea quark masses (because m'^ is not small) still 
results in CL < 10^^^ (x^/d.o.f. = 3.00 with 62 degrees of freedom). 

We note that it is not practical to use valence masses below those in subset / because 
we rapidly run out of data, and in any case we cease to reduce significantly the masses of 
mesons made of a valence quark and a strange sea quark. 

B. Inventory of parameters and alternative fits 

Since there are a large number of fit parameters, we provide an inventory before discussing 
the final fits in more detail. Our standard NNLO fits on subsets / and // have the following 
number and types of parameters: 

(a) LO. 2 unconstrained parameters: /i [Eq. (fT7j) ] and / [Eq. |T8|l ]. 

(b) NLO (physical). 4 unconstrained parameters: 2L^—L^, 2^5—^4, L5, L4 [Eqs. (fTTjl . 
(UHl) and (EZD]. 

(c) NLO (taste- violating). 4 unconstrained parameters: a^^y, o^^'a) o?L\ a^L" 
[Eqs. (HID and W\- 

(d) NNLO (physical). 10 parameters: . . . , p'i''^ [Eq. (jSSl)], and I3[^\ . . . , pij'^ 
[Eq. ()33j) ]. For preferred fits, these are constrained with Bayesian priors to have stan- 
dard deviation of 1 around 0. But alternative fits used for systematic errors estimates 
leave these parameters unconstrained. The difference in CL or final results is small. 

(e) Scale. 4 tightly constrained parameters that determine relative scale of different 
lattices: Cqo Cio, Cqi, C20 [Eq- ©]• These are allowed to vary by 1 standard deviation 
around values in Eq. (jSJ. The (small) variation in these parameters makes very little 
difference in CL or central values, but including the variation in the fit allows us to 
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incorporate the statistical errors in relative scale determination into the statistical 
errors of our results. 

(f) Lattice spacing dependence. 16 parameters (usually tightly constrained) that 
control the fractional difference in the physical fit parameters [(a), (b), and (d) above] 
between coarse and fine lattices. In our preferred fits we allow the LO parameters [(a)] 
to vary by 2% (at the 1 standard deviation level), and consider alternatives of "0%" 
{i.e., no variation in LO parameters between coarse and fine lattices parameters), 1%, 
and 4% in estimating systematics. For the NLO parameters [(b)], the central choice is 
2.5%; alternatives are 0%, 1%, 4%, and 6%. For NNLO parameters [(d)], the central 
choice is 2.5%; alternatives are 0%, 1%, 4%. We also consider complete removal of the 
constraints for various small subsets of the NLO and NNLO parameters. 

In our standard NNLO fit there are thus a total of 40 parameters, of which 20 are generally 
tightly constrained. 

We remind the reader that the NLO taste-violating parameters [(c) above] are forced 
to change by a fixed ratio (in a given fit) in going from the coarse to fine lattices. The 
point is that these parameters start at (9(a;|a^), so we know how they change with a, up 
to corrections that are higher order in as and/or a. A range for the ratio is considered in 
assessing the systematic error (see discussion following Eq. (fTUjl ). 

The priors restricting the parameters governing lattice spacing dependence [(f) above] 
require further explanation. We note first that it is not possible, at least with the current 
data set, to remove these restrictions on all the physical parameters, allowing them to be 
arbitrarily different on coarse and fine lattices. If we do that, the fit becomes unstable 
because there are directions in parameter space in which the fit function is almost flat. 
Some of these directions can easily be seen in Eqs. (fTTjl and (fTH|) . For example, a^L' + a^L" 
can grow large, compensated by a decrease in n and corresponding increases in 2L^ — L^, 
2LQ—L4, a^5^, a'^S'y, and the jS^"^^ in Eq. (jSH). Only the first (continuum) log term in Eq. (fTTj) 
is uncompensated, but we already know that the good fits allow a fairly large range in its 
coefficient, the chiral coupling. (See discussion of the parameter u in Sec. IVIBI ) A similar 
mode involves a'^L" — a^L' growing and / decreasing in Eq. (|T8|l . Even when such modes 
eventually converge, the resulting fit is completely unphysical, with 20%-100% variation 
of results with lattice spacing and enormous NLO chiral corrections. Once the physical 
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parameters are required to change by only a small amount with a, however, they cannot 
compensate for changes in taste-violating parameters like o^L' and a^L", and the runaway 
modes are damped. 

Since the taste-violating NLO parameters {a^V and a^L") are included explicitly, the LO 
parameters / and /x should have only generic (~2%) changes with a, which is our preferred 
choice for priors for their variations. Of course, as mentioned following Eq. ()42|). differences 
in /i between coarse and fine lattices can also be due to perturbative errors in the ratio 
Rm- When we use the one-loop value for R^, 0.958, the fits prefer an ^ 8% difference in 
the renormalized /z value between coarse and fine, which is one indication of the size of the 
perturbative error. 

For most of the NLO physical parameters [(b)], the preferred 2.5% prior for lattice- 
spacing dependence is not restrictive, with the fits finding a change that is significantly 
smaller (<0.5%). The exception is 2^8— for which the 2.5% prior results in almost a 6% 
difference (a 2. Sex effect), suggesting that the corresponding NNLO taste- violating term has 
a sizable coefficient. If we instead remove any restriction on the a-dependence of 2L^ — L^ 
(while keeping the constraints on the other physical parameters), 2L^ — L^ varies by ~20% 
from coarse to fine, which is sizable, ^^{x'^^2^^^° — X^°)- Perhaps generic and taste-violating 
effects are both contributing significantly in the same direction. Fortunately, the continuum 
extrapolated value of 2Lg, — L^ changes by only 5% when we remove the restriction on its 
a-dependence. In any case, the systematic errors on 2Ls — L5 (as well as on the other Li) 
are dominated by the larger changes caused by varying the mass range and/or the details 
of the chiral fits. Thus, even if we were to take the full 20% variation from coarse to fine as 
the "discretization error" on this parameter the final errors quoted in Sec. IXl would hardly 
change at all. 

Our preferred 2.5% prior for the NNLO physical parameters [(d)] is again generally not 
restrictive, with f]!^^^ the only exception. We note that allowing the NNLO terms to vary 
with a is not systematic,*^ because it effectively introduces some, but not all, NNNLO terms. 
Therefore we consider an alternative in which NNLO physical parameters do not vary with 
a ("0% priors"), but all other features of the preferred fit are unchanged. This fit has lower 
confidence level (CL = 0.36) than the preferred fit (CL = 0.65), but gives physical results 

^ We thank Laurent Lellouch for pointing this out. 
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that are very similar. 

The alternative choices of priors for parameters (f ) also give good fits, except in the cases 
of 1% (CL ^ 5 X 10^^) and "0%" priors^ (CL ^ 10^^). However we keep all the choices in 
the systematic error analysis. The 0% priors case, where all physical parameters are fixed as 
a function of a, gives results that are in no way extreme among all alternative fits considered 
in this work. 

The preferred NNLO fit with parameters (a)-(f) above results in good confidence levels 
on data subsets / and //. This is all we need to extract the low energy constants Lj, and it is 
acceptable for determining f.^ and rh. But to determine fx and without an extrapolation 
to heavier valence masses, we need to fit to the data in subset ///. We can then interpolate to 
the physical strange mass. However the preferred NNLO fit, or variants thereof, does not give 
good confidence level when applied to subset ///. Based on the discussion in Sec. lIXAl this is 
not surprising, especially considering that the simulation mass is larger than the physical 
strange quark mass. The problem, though, is that it is not possible to move beyond NNLO 
in anything approaching a systematic way without introducing an unwieldy number of new 
parameters. If instead we fit subset /// to our rather ad hoc NNNLO forms in Sec. I VI C\ we 
can obtain acceptable fits. But the high quark masses involved, as well as the nonsystematic 
treatment of the higher order terms, may introduce significant systematic errors in the low 
energy constants and therefore in the extrapolation of rh to the physical point, which is, 
of course, still needed for finding rris and J'k- Thus such fits are not acceptable for finding 
decay constants and quark masses. 

Our solution to the above dilemma is to use the results for all LO and NLO parameters 
from the fits to subset // as inputs to the fits on subset ///. We then use the form Eqs. (IHUj) 
and (jHTj) on subset ///, but with the LO and NLO parameters, and their change with 
lattice spacing, constrained to vary by at most their statistical errors around their previously 
determined values. The NNLO and NNNLO parameters are left unconstrained. Their 
variation with lattice spacing, which implicitly introduces higher order mixed terms in Xq 
and Xa2, either is also left constrained or is constrained mildly, with 1-a constraints of 10% 
(preferred fit), 15%, or 20%. The total number of parameters in this fit is 48, of which 20 
(16 LO and NLO parameters and their a-dependence, plus the usual 4 scale parameters) are 

^ In this case there are 24 free parameters in the fit. 
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tightly constrained. We get reasonable fits with all these versions. The advantage of this 
approach is that we can interpolate to the physical strange quark mass and extrapolate to 
the physical light quark mass in the same fit. We therefore use the preferred fit to subset 
/// for our central values for quark masses and decay constants. The ad hoc nature of 
the higher order terms in this fit is not a problem because we have already satisfied the 
chiral constraints at low quark mass. All that is required in the region of is a fit that 
interpolates from our partially quenched valence masses and nominal sea quark mass to the 
physical value. In other words, we do not need to rely in detail on chiral perturbation 
theory for the dependence, since we can reach the physical value of iris in the simulation. 
Effectively, we are depending only on two-flavor XPT. Note however that we include the 
more conventional chiral fits on subsets / and // among the alternatives when estimating 
the systematic errors. 

C. Fit results 

Figures ITT] and IT^ show our preferred NNLO fit to data subset //. We call this fit "Fit B"; 
the corresponding fit on data subset / is called "Fit A." Fit B is a single fit to the data in 
both Figs. E] and ^1 as well as many more data points not shown. The fit has a chi-square 
of 192 with 200 degrees of freedom, giving CL = 0.65. This is a standard CL, with 
summed over all data points, and number of degrees of freedom (d.o.f.) given by number of 
data points minus the number of parameters. If we include the Bayesian priors as effective 
"data points," then Fit B has a chi-square of 235 with 230 degrees of freedom, CL=0.39. 
The fact that this is also an acceptable CL indicates that SXPT and our assumptions about 
the a-dependence of fit parameters are reasonably well behaved in this mass range. Fit A 
gives very similar results for decay constants and quark masses, but includes many fewer 
points (94 vs. 240) and has a lower confidence level (0.23). As discussed in Sec. IIX D A\ 
however. Fits A and B produce rather different values for the low energy constant 2Lq — L4, 
indicating a large systematic uncertainty in that parameter. For our central values of the 
Lj, we average the results of Fit A and B, and include the difference in the error. 

FiguresElEl and^lshow tt and K decay constants and tt masses from the corresponding 
preferred NNNLO fit to data subset /// ("Fit C"). The LO and NLO parameters here are 
fixed, up to their statistical errors, by Fit B. Fit C has a chi-square of 383 for 368 degrees of 
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freedom (CL=0.28); including the priors gives a chi-square of 418 for 402 degrees of freedom 
(CL=0.28). Central values of fx, f-K, fx/ fir, ^s, rh, ms/rh, and rriu/md are taken from this 
fit; while Fits A and B are included as alternatives in estimating systematics. 

Using the volume-dependence from NLO SXPT, Eqs. (jl3|) and (jl4j) . the leading finite 
volume effects can be removed from our data. Such effects are rather small to begin with 
(< 0.9% on Mp+ and < 1.4% on fp+, based on fit B), and this calculated volume- dependence 
is consistent with simulation results in the one case where two different volumes are available 
[igj ]. One-loop finite volume effects have been removed from the points and lines shown in 
Figs. HI] - Hni Possible residual errors from higher order finite volume effects are discussed 
in Sec. I1XD6I 

To extract continuum results for masses or decay constants from Fits A, B, or C, we first 
set the taste splitting and the taste violating parameters to zero. We then extrapolate the 
remaining, physical parameters to the continuum linearly in asa"^- For central values, we 
assume that the ratio of this quantity between fine and coarse lattices is 0.427 (see Eq. (^Bj) ). 
For the LO parameters we take the range of the ratio to be 0.398 to 0.441 in estimating the 
systematic error, as in the discussion of Eq. (jSE))- But for all other parameters, we expand 
the range to 0.30 to 0.441 in recognition of the fact that the fits do not distinguish generic 
discretization errors, (9(a5a^), from taste- violating errors, (9(Q;|a^). We thus must include 
the range discussed following Eq. (jTUl). 

Table HVI shows the central values of the continuum extrapolated parameters for Fits A, B, 
and C. Note that the statistical errors on most of the parameters are quite large. This seems 
to be a consequence of the "flat directions" in the fitting function, as described in Sec. IIXBI 
small fluctuations in the data can produce large variations in the parameters. However, 
because of the correlations among the parameters, the statistical errors of interpolated or 
extrapolated decay constants and masses are small, comparable to those of the raw data. 

Once the continuum chiral parameters are obtained, we set valence and sea quark masses 
equal and obtain "full QCD" formulas for m^/(2m'), m|^/(m' + m(,), and fx in terms 
of arbitrary quark masses m' and m'^. The cyan lines in Figs. ITD - ITHl show these as a 
function of m', with held fixed at the value of the simulation sea quark mass on the fine 
lattices. As a consistency check, we also show in each case the result from extrapolation of 
full QCD points to the continuum at fixed quark mass (cyan fancy squares). To generate 
these points, we use the chiral fits only to interpolate the coarse data so that it corresponds 
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Fit A 


Fit B 


Fit C 




5.579(515) 


4.549(387) 


4.462(227) 


ri f 


0.186(14) 


0.185(21) 


0.185(15) 


{2Lq - Li) X 10^ 


0.244(156) 


0.705(157) 


0.763(89) 


(2L8 - L5) X 103 


-0.038(96) 


-0.330(113) 


-0.392(76) 


L4 X 10^ 


0.178(231) 


0.200(340) 


0.186(239) 


L5 X 10^ 


1.834(247) 


1.949(263) 


2.054(179) 




-0.566(80) 


-0.279(43) 


-0.260(79) 




-0.314(195) 


-0.994(193) 


-1.050(124) 




0.208(36) 


0.149(33) 


0.145(19) 




-0.281(22) 


-0.150(13) 


-0.081(9) 




0.554(336) 


1.658(367) 


1.861(310) 




0.237(62) 


0.188(36) 


0.257(85) 




0.135(142) 


0.131(209) 


0.128(141) 




0.189(36) 


0.182(33) 


0.150(54) 


AP 


-0.059(36) 


-0.058(24) 


-0.062(15) 


' 


-0.098(286) 


-0.115(415) 


-0.109(389) 








-0.130(44) 








0.049(52) 








-0.063(165) 


pU) 






-0.132(69) 



TABLE IV: Continuum extrapolated fit parameters for Fits A, B, and C, which are on mass subsets 
/, //, and ///, respectively. No extrapolation errors are included; we show statistical errors only. 
See Sec. IVII for definitions of the parameters. 

to the same physical quark masses as the fine data. There are just two such points in 
each plot because we have just two runs with different sea quark masses on the fine lattice. 
Since discretization errors come both from taste violations and generic errors, there is an 
ambiguity in the extrapolation used to find these points. We have assumed that taste 
violations dominate and have extrapolated linearly in a^a^, i.e., with a ratio of 0.35 in a^a^ 
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between coarse and fine (see discussion following Eq. ()10|) ). This agrees both with our order 
of magnitude estimates (taste violations ~ 6%; generic errors ~ 2%) and with a detailed 
analysis below (Sec. IIXD 

To proceed further we need to know the physical values of the quark masses. These can 
be obtained from Fig. or Fig. by finding those values of rh and that give the tt and 
K their physical QCD masses in the isospin limit, and (defined in Eq. ()45|) ). An 
iterative procedure is required because both meson masses depend on both quark masses, 
although the dependence of on rus is mild, since s only appears as a sea quark. The 
nature of this extrapolation/interpolation is most clearly seen in Fig.[Tni^° which is again Fit 
C, but now shown for squared meson masses as a function of light quark mass. For clarity, 
we plot data with only one choice of sea quark masses for the coarse and fine sets; the 
variation with light sea quark mass is quite small on this scale. The red dashed lines show 
the fit after extrapolation to the continuum, going to full QCD, and iteratively adjusting 
the strange quark mass to its physical value, so that the pion and kaon reach their physical 
QCD values at the same value of m. 

Note that nonlinearities in the data are quite small on the scale of Fig. ^1 Linear fits to 
as a function of + niy would change the physical quark mass values by only 2% to 
7%, depending on the range of quark masses included and whether or not we fit the decay 
constants simultaneously. (The correlation between masses and decay constants implies that 
the fits are correlated even in this case, where they have no free parameters in common.) 
However, the tiny statistical errors in our data imply that even small nonlinearities must be 
accurately represented in order to obtain good fits. Indeed, linear fits have x^/d.o.f. ~ 20. 
For an example of the accuracy of a linear fit, see the lowest fit line (for Goldstone pions) 
in Fig. El 

Once the physical s quark mass is in hand, we can adjust the cyan lines in Figs. ITT] 
ITSlto put the strange mass at its physical value. This gives the dotted red lines. Following 
the dotted red lines to the physical light quark mass m gives our extrapolated results for /^r, 
plotted as red fancy pluses in Figs. [TT] and UHl The errors in the red fancy pluses are statistical 
only; the systematic errors are shown separately. We also plot the "experimental" values 
of the decay constants 37j, where we put experimental in quotation marks to emphasize 

An almost identical plot, but without the extrapolation to find mu/md appeared in Ref. [2^ . 
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that the decay constants are extracted from experiment using theoretical input and values 
of CKM matrix elements, which themselves have uncertainties.^^ 

Figure ITHl also shows how is extracted: With rris and m determined, we can continue 
the full QCD kaon line as a function of the light valence quark mass (holding the light sea 
quark mass fixed at m) until it reaches the value of (?^|;+)qcd5 Eq- (!46j) . The continuation is 
shown as a green dashed line. For clarity, a magnified version of the relevant region is shown 
in Fig. El There is a slight change in the slope of the dashed line at m|. because, below 
this mass (green section), the light sea quark mass is no longer changing. Above this point, 
light valence and sea masses change together. The ratio of the x coordinates of the points 
where the kaon line intersects the physical (m^+)QCD and values is m„/m = 0.60. 

Given niu, we can extract fx, which is really fK+, not the decay constant "/^" of an 
isospin-averaged kaon. After extrapolating the chiral parameters to the continuum, we set 
valence and sea strange quark masses equal to the physical m^. We now make a two-step 
extrapolation in the light quark mass, as shown in Fig. We first set the light sea mass 
equal to the light valence mass m^. and extrapolate in down to m (red dotted line). 
We then fix the sea quark mass at m and continue the extrapolation in valence mass rrix 
to rriu (short green dotted line). It is clear from the size of the systematic errors that this 
final short extrapolation does not at present produce a significant change in fx- However, 
this distinction between fx+ and will become more important as lattice computations 
improve. 

D. Discussion 

Good fits are not possible without the taste- violating terms in SXPT. Figure ^1 corre- 
sponds exactly to Fig. ^2 except that in Fig. the taste splittings in meson masses have 
been set to zero and the taste-violating chiral parameters {6y, 5^, L', and L") have been 
eliminated. Thus the fit in Fig. ^Jis to the "continuum" NNLO XPT form,^^ which has only 
four fewer parameters than the SXPT fit in Fig. ITTlfz.e.. a total of 36). We put "continuum" 
in quotation marks here because generic variations in physical parameters between coarse 
and fine lattices are still allowed. Further, even if we allow these generic variations to be 

See SecElfor additional discussion about the experimental value of /k- 
■^'^ As in the SXPT case, NNLO chiral logarithms are not included. 
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arbitrarily large, instead of the ~ 2% variation permitted in the standard fits, we cannot 
obtain good fits without including the taste violations. The fact that continuum XPT fits 
are so poor reassures us that the good fit in Fig. ^2 is not simply a trivial consequence of 
having a lot of fit parameters — one has to get the physics right. Other test fits described 
below, such as fitting without the nonanalytic terms in the fit function (Sec. I1X-D2|) . give 
additional reassurance, since they have equal or comparable numbers of parameters to Fit 
B but cannot describe the lattice data. 

1. Continuum extrapolation 

For the decay constants, our preferred method of continuum extrapolation is to extrap- 
olate the chiral fit parameters, as described in Sec. IIX CI An alternative method is to 
determine decay constants in physical units at fixed lattice spacing, and then attempt to 
extrapolate these quantities. There are two ways to find fixed lattice-spacing values: (1) 
simply use the complete chiral fits to extract the decay constants on the coarse or fine sets, 
or (2) first set the taste- violating parameters {5'y, 5^, L', L") and splittings in the fit to zero, 
and then extract the decay constants for each set. The advantage of method (2) is that, once 
taste- violations have been set to zero, remaining discretization errors should be dominantly 
of the generic type, so we may extrapolate to the continuum linearly in aso? ■ In method (1), 
the decay constants at fixed a have both generic and taste- violating discretization errors, so 
there is an ambiguity in extrapolating them to the continuum. 

Figure ^1 shows the decay constants at fixed a using both methods, and the various 
extrapolations to the continuum. Once taste violations are removed (method (2)), the 
remaining discretization errors are quite small, giving a l%-2% change between coarse and 
fine, as expected. Further, because this change is small, it would not make a significant 
difference in the extrapolated answer if we were to replace the aso? extrapolation with an 
a^gO^ one. With method (1), the change from coarse to fine is 5%-6%, roughly the same size 
as the difference in the raw data between these lattices: See Figs. El^El^'^ There is also 
a noticeable difference (~1.2%) in the extrapolated results with method (1), depending on 
whether 0{asa^) or 0{a\a^) errors are assumed. This ambiguity would grow to ^2.2% if 

The difference between coarse and fine raw data appears to be significantly greater than 6% in Fig. I14L 
but this is because the raw points also have different strange valence quark masses. 
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we used the full allowed range of values for the 0{asa?) or C(a;|a^). For f^, and fx, we 
therefore do not consider method (1) results among the possible alternatives in assessing 
the systematic error. We use the parameter-extrapolated version (red diamond and fancy 
plus in Fig. fTS|) for central values, and include the results of method (2) when estimating 
systematic errors. Note that we are rejecting method (1) because of its inherent ambiguity, 
not because it disagrees with the other methods of extrapolation. 

On the other hand, for the ratio fx/fn and for quark masses {rriu/md here, and m^, m 
and ms/rh in Ref. 2^), the results change little with lattice spacing, so the ambiguity in 
method (1) is tiny (much less than other systematic errors). Therefore, in those cases we 
include all three methods of extrapolation in our systematic error estimates. 

The nice consistency of Fig. with our understanding of the sources of discretization 
errors is comforting. However, we caution the reader that some aspects of this picture 
are dependent on the assumptions that went into our fits. In particular, we have inserted 
Bayesian priors to enforce a (1 standard deviation) constraint that LO chiral parameters 
change by at most 2% from coarse to fine lattices. When we relax this constraint to 4%, 
the difference of method (2) results from coarse to fine increases to 3%-4%; while method 
(1) differences remain at about 6%. The relaxed version of the chiral fit is included in the 
systematic error analysis. With just two lattice spacings, however, we cannot remove this 
constraint entirely without the fit becoming unstable, as has been emphasized in Sec. IIX Bl 
Instead, the key point here is that we can obtain good fits of our entire data set based on the 
theory of taste violations, plus some smaller generic errors. This is to be contrasted with our 
failed attempts to fit the data without including taste-violations, even when generic errors 
are allowed to be arbitrarily large. 



2. Evidence for chiral logarithms 

From Sec. IVI Bl we know that the coefficient of the chiral logarithm terms, the chiral 
coupling l/(167r^/^), is not tightly constrained by the fits. If the chiral coupling is allowed 
to be a free parameter, fits to higher valence masses prefer values of / near fx in the 
coupling; while fits to lower masses prefer / closer to On the other hand, acceptable fits 
can be obtained for all our mass ranges with the chiral coupling fixed anywhere between its 
value for /^r and that for fx- Given this freedom, can we claim that chiral logarithms are 
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observed at all? To answer this question, we consider a variety of alternative fits without 
chiral logarithms. 

First of all, since Fig. appears so linear to the eye, one can ask whether a simple linear 
fit would work. The answer is no: linear fits of m^^ vs. + rriy have chi-square per degree 
of freedom ~ 20. The point is that the statistical errors in the data are so small, and the 
correlations are well enough determined, that the small departures from linearity must be 
accurately represented by the fits. These deviations are seen more clearly in Figs. ^1 and 
[T31 where the valence quark masses are divided out. Similarly, even though the apparent 
curvature in the decay constant data is not large (see Figs. [TT] and UHl), linear fits of fp+ vs. 
rrix + my are also terrible, with chi-square per degree of freedom ~25. 

We next check whether the data can be fit by including all the higher order (non-linear) 
analytic terms, but omitting the chiral logarithms. With the chiral logarithm functions ^, 
Eq. (fT^ . and Eq. (fT^. set to zero, we attempt a fit directly comparable to our NNLO Fit 
B on mass subset //. This fit has 38 free parameters, which is 2 less than Fit B because the 
taste- violating hairpin parameters 5^ and 5'y decouple when £ = £ = 0. Despite the large 
number of parameters, the fit is very bad, with x^/d.o.f. = 7.38 for 202 degrees of freedom; 
CL < 10-^9^ 

Becirevic and Villadoro have pointed out that, for some current simulations on small 
volumes, the finite size effects are much more important than the actual chiral logarithms. 
We would not expect that to be the case here since the lattice volumes are relatively large 
(L > 2.5 fm) and finite volume effects here are small (at most 1.4% — see Sec. IIX Vj\ . To 
check this expectation, we first removed the finite volume effects from the data using Fit B, 
and then fit again to the 38 parameter form with £ = £ = 0. This fit is improved over the 
previous one, but still quite bad: x^/d-o.f. = 3.08 for 202 degrees of freedom; CL < 10^"^^. 
Our conclusion is that the effect of the staggered chiral logarithms is in fact observed in our 
data. 

One can ask whether the finite volume effects are also directly observed. The answer 
seems to be yes: A 40-parameter fit leaving out these effects (setting 5i and ^3 to zero in 
Eqs. (fT^ and (^U) but otherwise identical to Fit B has x^/d.o.f. = 1.95 for 200 degrees of 
freedom; CL ^ 2 x 10^^^ 
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3. Are the lattice masses light enough for SXPT to be applicable? 



To discuss this question, we first have to say what we mean by the physical quark masses 
at fixed a. For current purposes we define the physical values of the lattice masses, arris and 
am, by method (2), i.e., as the quark masses that give the pion and kaon their physical masses 
when all taste splittings and taste-violating chiral parameters are set to zero. This gives 
arris ^ 0.0457, arh ^ 0.00166 on the coarse lattices and arrig ^ 0.0289, arfi ^ 0.00105 on the 
fine. We have not made a detailed study of the errors in these numbers, but systematic errors 
are ~6%, and statistical errors are 1% or less. We could alternatively define the physical 
at a given lattice spacing by method (1), i.e., as the masses that give the physical mesons 
masses directly, including all effects of taste violations in the chiral loops. The latter values 
of quark masses were quoted in 
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221 1 and are about ~ 15% smaller on the coarse lattices 
and ~ 6% smaller on the fine lattices than the ones quoted above. We choose the method 
(2) definition here because we are going to be adding on the taste-splittings to meson masses 
explicitly. We note that the difference between the methods becomes becomes small, 1%- 
2%, once we extrapolate to the continuum, and is included in the systematic errors quoted 



in Ref. 



221. 



We now consider meson masses on the coarse lattice for subset //. This is the worst case 
because subset // contains the highest valence quark masses for which we have applied a fully 
chiral description, and the coarse lattices have the largest additional contributions to meson 
masses from taste violations. The valence masses here obey rrix + rriy < O.Tm'j, 0.77ms. 
Since the smallest valence mass is 0.1 m^, the largest is 0.6 m'^ ^ 0.66 m^; while the largest 
sea quark mass in the simulation is m'^ ^ 1.09 ms. For the following estimates, we assume 
linear dependence of squared meson masses on quark masses, and take 486 MeV as the 
mass a "kaon" would have in the absence of electromagnetism and if the light quark were 
massless. In other words, we use fims = (486 MeV)^. This comes from Eq. and the ratio 
ms/rfi = 27.4 [2^. Then the largest valence- valence Goldstone meson mass in coarse subset 
// is 425 MeV. Adding on the largest splitting (the taste-singlet case) gives 623 MeV; while 
an "average" taste splitting (see discussion after Eq. (|12p) gives 551 MeV. We do not think 
it unreasonable to expect SXPT to work in this mass range, although it is not surprising — 
considering the small statistical errors of the data — that NNLO terms are needed. Further, 
the comparison with subset / results is a good check, because the corresponding masses there 
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are significantly lower: 321 MeV, 557 MeV, and 475 MeV, respectively. 

Mesons with one or two sea quarks also appear in chiral loops. These are generally 
comparable to the masses just discussed. But they are significantly larger when the sea 
quark is an s, which is exacerbated by the fact that the simulation value, m'^, is larger 
than the physical mass. On subset //, this largest valence-sea (Goldstone) mass is 642 MeV. 
Adding on the biggest splitting gives 787 MeV. This taste-singlet meson enters with a factor 
1/16 in the sum over tastes; the "average" taste version is 731 MeV. On subset /, the 
masses become 578 MeV, 736 MeV, and 676 MeV, respectively. Further, there are sea-sea 
contributions, which are independent of the valence mass subset. The most relevant here 
is the taste-singlet rj. Its mass depends (mildly) on the light quark sea masses, but is 
~ 765 MeV including splitting. In addition there are T^'-like particles in the taste axial and 
vector channels whose masses are comparable to the taste-singlet rj but have smaller ((9(a^)) 
couplings. 

The meson masses involving the s sea quark are admittedly quite high to expect that even 
NNLO XPT will be accurate. For example, the largest mass mentioned above, 765 MeV, 
corresponds to a Xq value, Eq. (fTT|) . of 0.43. This suggests an error from neglected terms of 
order (0.43)^ = 8%. But, just as for the valence subset ///, the issue here for decay constants 
and quark masses is only to get a good interpolation to the physical s quark mass. Indeed, 
if the s sea quark in the simulation had been chosen at the a posteriori determined physical 
mass, we would not have needed to use XPT for the s at all, but could use a SUl{2) x SUr{2) 
XPT for the light quark extrapolation. The systematic error on the coarse lattice from 
adjusting m'^ = LOOm^ to may be crudely estimated as (0.43)'^ x [(1.09)'^ — 1] ^ 2%. 
Since m'g is closer to on the fine lattice, some of this error will be extrapolated away 
when we go to the continuum limit. On the other hand, chiral coefficients (at NLO and 
NNLO) that involve the sea quarks are not fit accurately because the "lever arm" is small: 
the sum of the sea quark masses changes by less than a factor of 2 over our entire range of 
coarse lattices and only by 30% for the fine lattices. A more reliable estimate of the error 
in adjusting the s quark mass comes from considering the range in results over the full list 
of alternative mass subsets, chiral fits, and continuum extrapolations. It can be as large as 
half the total chiral error in our results for decay constants and quark masses (see line al in 
Tables IVl and IVHl : the remaining error comes from extrapolating in the light quark mass. 

For the Lj, the situation is somewhat different. Missing higher order terms in the 
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SUl{S) X SUr{3) expansion mean that there is spurious analytic dependence on the hght 
quark masses that increases as meson masses get larger. Here we are missing the NNLO 
chiral logarithms, so those terms determine the size of the errors. Letting M be a generic 
meson mass, the absent terms are of order M"^ log(M^)/(87r^/^)^. Putting in the largest 
meson masses discussed above, results in an estimate of the absolute error in the Lj of a 
few times 10~^. This is indeed the size of the errors we observe when we consider all the 
alternative chiral fits discussed above and/or restrict the valence masses to subset / instead 
of subset // (Sec. |X|). The low energy constant 2Lq — L4 may be an exception: Since the 
errors in it are large on this scale, ~4 x 10~^, they may also not be very reliable. Difficulty 
in extracting 2Lq — is again related to the small lever arm for the sea quark dependence. 
This can be seen in Fig. E[ an enlargement of a small region of Fig. ^| As the sea quark 
mass is changed, the differences are small — comparable to statistical errors — and not 
monotonic. Contrast this with the monotonic sea quark dependence seen for /^^ in Fig. El 
A coarse simulation now in progress, with all three sea quark masses at about 0.66ms, should 
help to reduce significantly the error in the sea quark mass dependence. 

4-. Convergence of XPT 

Figure EDI shows the convergence of SU{?>)l x SU{?>)r XPT for and fx- AH chiral 
parameters in this plot have been extrapolated to the continuum. The NLO terms contribute 
20%. This is true even for because does not vanish in the chiral limit. The convergence 
of SU{2)l X SU{2)ji XPT for can also be extracted from this plot by starting with the 
"chiral limit" line instead of the "LO" line lowest order contribution to /tt. Note that the 
magenta and cyan fancy squares, which are included as a consistency check, are the only 
full QCD points that we can extrapolate to the continuum at fixed mass. We have lighter 
valence quarks on the fine lattices, and lighter valence and sea quarks on the coarse lattices. 
All such partially quenched points are included in the fits that produce the lines in the plot. 
Figure QUI comes from the fit to data subset // (Fit B). If instead we restrict the fit to data 
subset / (Fit A), the picture is virtually unchanged. 

Figures E] and |22l are the corresponding plots for meson masses {m'^p+/{mx + iTiy)). 
Figure ED is generated from the fit to data subset // (Fit B); while Fig. |221 uses data 
subset / (Fit A). Here there is a significant difference between the two plots, with the 
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latter showing much smaller higher order corrections in SU{3)l x SU{3)ii XPT than the 
former. The difference illustrates the poor control over the low energy constant 2LQ — L4 (see 
Sec. llXDHj) : We get 2^6 -L4 = 0.70(17) x lO^^ with Fit B, and 2^5-^4 = 0.24(16) x 10"=^ 
with Fit A.^^ Because this parameter multiplies 2m + m^, its effect does not vanish in the 
chiral limit of the light quark mass m. Its variation is largely canceled by differences in the 
LO parameter /i and the NNLO parameters /^g™'' and (3^^\ Eq. ()32|). so that the full NNLO 
fit line and the extrapolated vr and K values are quite close in the two fits. This means that 
the ambiguity in 2Lq — L4 and the LO term is largely irrelevant to the extraction of quark 
masses (and, indirectly, decay constants); the variation between the fits is of course included 
in systematic errors estimates of these quantities. 

The qualitative expectation from XPT is that coefficients in the expansion should be (9(1) 
when we use the dimensionless expansion parameters Xg^ Eq. ()11|) . Both Fit A and Fit B 
pass this test (the largest coefficient in either is /Jg"*^ ~ 1.66 in Fit B — see Table HV]) . so 
we must accept the large systematic effect on 2Lq — L^^ as inherent in the current data set. 
Indeed, the size of the difference in the LO term, /x, between the two fits is reasonable, given 
that the fits prefer a chiral coupling l/(167r^/^) with / moving from io ^ fx as the 
quark masses rise toward (see comments about the parameter 00 in Sec. IVI B|) . Both fits 
here fix / = /^r, but Fit B is effectively able to reduce the effect of the chiral coupling by 
reducing the LO parameter fi and compensating by increasing the NLO parameter 2LQ — L4. 
Since the difference between and f"^ is more than 40%, the ~ 20% difference between 
the /i from the two fits is not unexpected. New simulations with lighter strange sea quark 
masses will allow us to take all quark masses deeper into the chiral regime, as well as greatly 
increase the lever arm on the dependence, and should help to resolve this issue. 



5. Mass dependence of renormalization scheme 

Strictly speaking, it is incorrect to use a scale determined by a quantity like ri or the 
T'-T mass difference in chiral fits to lattice data, since such quantities themselves have some 



(small) sea quark mass dependence not included in chiral perturbation theory 



We have 



investigated this effect by changing to a mass-independent renormalization scheme: Instead 



^'^ The statistical errors here are slightly larger than those in Table llVl because the statistical errors associated 
with the continuum extrapolation are included. 
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of fixing the (relative) lattice scale for particular sea quark mass values from ri/a at those 
values, we can use, on each lattice, the value of ri/a after extrapolation to the physical 
mass values^^ using the fit Eq. (|7j). This produces the same value of a on all the coarse (or, 
separately, fine) lattices, independent of the sea quark masses. We include the difference 
between the mass independent scheme and our standard approach in our assessment of 
systematic errors. 

For decay constants and quark masses, the change in scheme is a priori unlikely to make 
much difference because the physical point is unaffected — all that may change is the ex- 
trapolation to it. Further, any low-order analytic dependence on sea quark mass introduced 
though ri would automatically be compensated by changes in the analytic chiral fit param- 
eters. So the only problem would be due to nonanalytic quark mass dependence, which is 
probably quite small because ri is a short-distance quantity, at or near the perturbative 
region. We thus consider the variation in scheme simply as another alternative version of 
the chiral fits. This means that it would affect the final systematic error only if it produced 
the largest difference from the central value over all the alternatives. In fact it is fairly small, 
as expected (see line a3 in Tables IVl and IVIjl . 

The situation is logically quite different for the low energy constants. Here, analytic 
sea quark mass dependence in ri would directly change the output values of L4 and 2Lq — 
L4, which multiply sea quark masses. We therefore consider the scheme dependence as 
a systematic error in its own right, and add any error found in quadrature with other 
systematic errors. In practice, however, this effect is still smaller than other errors (see 
Table EUj). 



6. Residual finite volume effects 

At the precision we are working (especially for fx/ fix)-, it is important to consider whether 
finite volume effects coming from terms beyond one-loop in SXPT could be non-negligible. 
Indeed, Colangelo and Haefeli 3] have recently investigated such effects in full continuum 
QCD. For volumes and masses comparable to those used here, they find large higher order 
corrections to the volume dependence, roughly 30% to 50% of the one-loop results. 



Here we consider both the quark mass values determined by method (1) and those determined by method 
(2) — see Sec. liXD3l 
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In asymptotically large volumes, the finite- volume effects in SXPT are suppressed relative 
to those in continuum XPT for the same (Goldstone) masses because most of the pions 
entering chiral loops have larger masses. This can be easily seen in Eq. (|24p : the lightest 
(Goldstone) pion appears with a weight 1/16 as large as in the continuum. However, at 
our current volumes, masses, and lattice spacings, the relation between SXPT and XPT 
finite volume effects is complicated, with the former just as likely to be larger than the 
latter as smaller. Equations and ()25|) . together with our numerical results for splittings 
(Table IIII|) and the taste- violating hairpin parameters (Eq. (jlTj), below), show how the 
asymptotic rule can be violated. Since Aa is the smallest splitting, and 5^ is non-negligible 
and negative, m^/^ may not be much larger than the Goldstone pion mass. Then, due to 
the factors of 4 in Eq. (j24|) . finite volume corrections coming from the can be as large or 
larger than the continuum corrections. As a — > 0, the term £(m^, ) would be canceled by 
£(m^^), but this cancellation may not be effective for finite volume effects at a given value 
of a, because the volume effects are sensitive to small mass differences. Since one-loop finite 
volume effects on our lattices are comparable to those in the continuum, we have no a priori 
reason to expect that our results are protected from the higher order effects j3| seen in the 
continuum. 

Assuming that the higher order finite volume effects in SXPT are roughly the same size as 
those in full continuum QCD when the volumes and the masses of the mesons in the loops 
are the same, we can estimate the resulting systematic errors. For our data, the biggest 
one-loop finite volume effects appear when both the valence masses and m' are small (giving 
a light r]'j^ in the two-flavor case, Eq. or a light tja in the three flavor case, Eq. ((221) )■ 

The worst case occurs in the coarse run with am' = 0.007; the run with m' = 0.005 has 
smaller flnite volume effects because L ^ 3.0 fm there, instead of L ^ 2.5 fm for other runs. 
FVo,„ the calcuMio. Ref. Q/« we estimate that the ..dual MgKe. o.de. fl.ite volume 
effect is at most 0.47% in /^r, 0.24% in fx, and 0.23% in fx/ fw More stringent bounds on 
the errors can be obtained by removing from the data set those points that have the largest 
finite volume corrections. Eliminating 8 of 240 points from mass subset // (5 from coarse 
run 0.007/0.05, 2 from coarse run 0.01/0.05, and 1 from fine run 0.0062/0.031), we lower the 

We are indebted to Gilberto Colangelo for providing us with the results for higher order effects in and 
fji at the values of volume and meson mass relevant to our computations 



50 



largest one-loop finite volume effect on fp+ or M|+ from 1.35% to 0.81%. Not surprisingly, 
since the reduced data set retains most of the lowest valence mass points and all of the 
lowest sea mass points, it produces nearly identical results (within 0.05%) as the original 
set. But a repeat of the analysis using Ref. Q now bounds the residual finite volume error 
by 0.29% in f^,, 0.15% in fx, and 0.14% in fx/ fn- These are negligible compared to our 
other systematic errors. In the future, however, as quark masses in staggered simulations 
decrease further, it will be necessary either to have a better handle on these higher order 
effects in SXPT or to go to significantly larger volumes. 



7. Fourth root of the determinant 

In order to eliminate the quark doubling that is still present in the staggered action, the 
simulations here take the fourth root of the quark determinant for each fiavor in order to 
reduce the quark tastes from 4 to 1 per flavor. There is apparently no ultra-local lattice 
action that would correspond to the effective action that results from taking this fourth root. 
The possibility thus exists that physical non-localities will remain in the continuum limit, 
potentially spoiling the description of QCD by the staggered action. The good agreement of 
the staggered results with experiment and with continuum chiral behavior plus understood 
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IS 



) lead us to believe that 



discretization effects (both in current and previous work 
this is not a problem, but the question is not settled. 

The comparison of simulation data with SXPT forms allows us to make a crude but 
somewhat more direct test of the fourth root trick. Equations fll7|) and ()18|) . as written, 
take into account the fourth root by dividing each sea quark loop contribution by 4, to leave 
1 taste per flavor. It is a simple exercise to generalize Eqs. (fT7j) and ()18|) to make the number 
of tastes remaining a free parameter. We can then ask what number of tastes per flavor 
is preferred by the simulation data. With a flt otherwise identical to our standard NNLO 
flt, we flnd 1.44(15) for the preferred number of tastes per flavor on data subset //, and 
1.28(12) on the lighter masses in subset /, where the errors are statistical only. If we allow 
the chiral coupling to vary also (choice (3) flts. Sec. IVIB|) . we get 1.35(18) and 1.22(14) 
on subsets // and /, respectively. In the latter case, the coefficient uj corresponds to an / 
in the chiral coupling that is about halfway between and fx (^^ ~ 0.82(11)), which is 
reasonable. Given that the fits in any case do not tightly constrain the chiral logarithm 
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actory. 

49^ and analytic j50| work indicating 



terms (see Sec. IVIB|) . we consider these results satis 

We note that there has been recent numerical 
even more directly that the fourth root trick should work as expected. On the other hand, 
there have been two other recent papers that purport to show problems with locality [51\ . 
We do not believe the latter work is worrisome because it does not take into account the 
taste structure of staggered quarks. Instead of trying to project onto a single taste to find 
the fourth root of the determinant, those papers look only at the fourth root of the Dirac 
operator itself. That procedure, in our opinion, is almost guaranteed to find a nonlocal 
result, just as it would in trying to reduce eight Wilson fermions to two, which certainly has 
an alternative, local solution. 



8. Taste violating hairpins 

Before turning to our physical results, we quote the values of the two taste-violating 
hairpin parameters coming from the fits. Together with the splittings. Table IIIH these 
parameters appear in SXPT calculations for other physical quantities, such as heavy-light 
decay constants ^]. Averaging values from Fits A and B, we find, on the coarse lattices: 

r^S'^ = -0.28(3)(5) 

rla% = -0.11(8)(^f ) , (47) 

where the errors are statistical and systematic, respectively. The latter error comes from 
the variation over all acceptable chiral fits on mass subsets / and //. The parameter a'^S'^ 
is comparable in size to the taste- violating splittings f Table UTT]) : while a'^6y is consistent 
with zero but poorly determined. The values of a^(5^ and a^Sy on the fine lattices are not fit 
separately but are constrained to be 0.35 times as large for central- value fits. (See discussion 
following Eq. 



X. FINAL RESULTS AND CONCLUSIONS 

The central values and error estimates for f^,, fx, and fx/ f-K are collected in Table El 
Central values come from Fit C (mass subset ///). The scale errors are found by repeating 
the analysis after moving our value ri = 0.317 fm by plus or minus one standard deviation 
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(±0.007 fm); see Sec. IIIII The change in decay constants under this variation in the scale 
is shghtly less than the nominal 2.2% = 0.007/0.317. This is due to a cancellation coming 
from the corresponding readjustment of the quark masses needed to give the mesons their 
physical masses. 

The (indirect) EM errors just come from changes in our results for quark masses due to 
the assumed range of A^; (see Sec. IVI1I|) : clearly this effect is very small. Direct EM effects 
that pertain to the comparison of decay constants with experiment are much larger, of order 



several percent — see Ref. j33|. However the direct effects are not relevant here because 
and Jk are defined in the absence of electromagnetism. 

The chiral /continuum errors are found by taking the maximum deviation from the central 
value over all versions of the chiral fits described in Sec llXHl and Sec. IIX iJSl and all versions 
of the continuum extrapolations described in Sec. IIX CI and Sec. IIX DTI fincluding ranges in 
assumptions about how asa"^ and a'j.a^ change from coarse to fine lattices — see discussions 
Eqs. ()26|1 and (|TO|l ). as well as variation in the perturbative parameter Rm described in 
Sec. I VI II Because the continuum and chiral extrapolations are connected within SXPT, 
it is not meaningful to quote separate errors for each. However, since a large number of 
alternatives are considered here, we believe it will be helpful to the reader to report the 
variations in physical results as one moves along various "slices" through the alternatives. 
The slices shown in Table |3 are defined as follows: 

a. All alternative chiral fits on all mass subsets, but only with the preferred method of 
continuum extrapolation (extrapolation of chiral parameters), and only with preferred 
values of the ratios of a^a^ and of a|a^ (Eqs. ()26|) and ()10|)). 

al. Same as a, but restricted to mass subset ///. This is mainly an estimate of the 
errors involved in interpolating around rris. 

a2. Same as a, but restricted to chiral fits where the chiral coupling a;/(167r^/^) is 
allowed to vary, with = 1.0 ± 0.1 (see Sec. IVI B|l . 

a3. Same as a, but restricted to fits where the scale is chosen in a mass-independent 
manner (see Sec. IIXD5jl . 

b. Alternative values of the ratios of asa'^ and of a|a^ used in continuum extrapo- 
lation and/or alternative method of extrapolation (method (2) - Sec. IIXDT| and/or 
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fn 


fK 




central value 


129.46 


156.63 


1.2099 


errors 


statistics 


0.87 


0.98 


0.0042 


scale 


+2.35 


+2.58 


+0.0027 




-2.36 


-2.51 


-0.0020 


(indirect) EM effects 


0.01 


0.10 


0.0009 


chiral / continuum 


+2.37 


+2.19 


+0.0125 


extrapolation 


-2.58 


-2.59 


-0.0112 


chiral/continuum error "slices 




a 


+0.79 


+0.60 


+0.0093 




-2.50 


-1.84 


-0.0075 


al 


+0.35 


+0.27 


+0.0075 




-1.05 


-0.42 


-0.0037 


a2 


+0.63 


+0.02 


— - 




-0.56 


-1.20 


-0.0057 


a3 






+0.0024 




-1.11 


-1.05 




b 


+2.19 


+2.02 


+0.0089 




-1.33 


-1.89 


-0.0095 


bl 


+0.12 


+0.09 


+0.0017 




-0.52 


-0.62 


-0.0019 


b2 


+0.69 


+2.00 


+0.0089 




-0.63 


-1.60 


-0.0065 


b3 


+0.39 


+0.89 


+0.0032 


c 


+0.27 


+0.26 


+0.0015 




-2.52 


-2.79 


-0.0007 



TABLE V: Central values and error estimates for /^r, /k, and JkI J-k- All errors are absolute 
amounts, not percentages. Decay constants and their errors are in MeV. Unsigned errors are taken 

as symmetric. The chiral/continuum error "slices" show variation under reduced sets of possible 
alternative fits/extrapolations; see text. 
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alternative value of Rm (Sec. IVII|) . The preferred chiral fit is kept (Fit C). 

bl. Same as b, but restricted to the preferred value (0.35) of ratio of a|a^ for all 
taste-violating quantities. 

b2. Same as b, but only the ratio of a|a^ is varied (in the range 0.3-0.4) and only 
for taste-violating quantities that are not directly measured (5^, 6y, L', and L" — 
see Sec. IVl A|) . The preferred continuum extrapolation (extrapolation of chiral fit 
parameters) is used. 

b3. Same as b, but only is varied, and the preferred continuum extrapolation is 
used. 

c. Alternative method of extrapolation (1) is used and ratio of varies over union of 
ranges of a^a^ and of a^a^ — Sec. IIX D II 

As discussed in Sec. IIXD ![ method (1) continuum extrapolation (slice c) is not included 
among our systematic alternatives because of the large ambiguity in how to perform the 
extrapolation. Table |3 shows, however, that it produces deviations comparable to the full 
chiral/continuum extrapolation error. 

We add in quadrature the signed errors from the chiral/continuum extrapolation, the 
scale determination, and from direct EM effects, giving a total positive and a total negative 
systematic error. We then take the larger of the two as a final symmetric error. Note that 
chiral extrapolation errors and scale errors contribute almost equally to the systematic error 
on and fx', while scale errors are unimportant for the ratio. The final results for decay 
constants are: 

= 129.5 ± 0.9 ± 3.5 MeV 
fx = 156.6 ± 1.0 ±3.6 MeV 
fK/U = 1.210(4)(13) , (48) 

where the first error is statistical and the second is systematic. 

In Sec. IVlBl we argued that fits that allowed the chiral coupling to vary by more than 
10% ("choice (3)" fits with arbitrary uj) should be excluded from the analysis. If we were 
to include all choice (3) fits in the systematic error analysis, the error on would increase 
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from 3.5 MeV to 4.3 MeV; that on fx/ f-K would increase from 0.013 to 0.022; while that on 
fx would be unchanged. 

Our results are in good agreement with the experimental numbers js^]: f-w = 130.7 ± 
0.4 MeV, fx = 159.8 ± 1.5 MeV, fx/fn = 1.223(12). Note that the experimental determi- 
nation of fx has a rather large error. That is because it depends not only on the precisely 
measured leptonic decay width of the kaon, but also on Vus, which has a significant un- 
certainty. The errors on our result for fx/fn are small enough that one may turn the 
comparison around, and use our answer together with the measured leptonic decay widths 
to constrain With Eq. (16) in Ref. |^, = 0.9740(5), and the current result for 

Ik/Itv, we obtain 

\Vus\ = 0.2219(26) . 

The error is completely dominated by current lattice errors, which we have added in quadra- 
ture. Neglecting |V^bp, the unitarity relation is then 

|Kd|' + |K.|' = 0.9979(15) (49) 



The 2a violation that comes from using the PDG value \Vus\ = 0.2196(26) 37[ becomes 
a 1.4(7 effect here. We note also that our result is compatible with the very recent KTeV 
determination |K.| = 0.2252(8)(21). 

ues for and fx in Eq. should be considered as updates of those presented 
The current results are based on an expanded data set. In addition, the analysis 



The ya 
in Ref. 
in Ref. 



U ■ 

17| was performed differently: The data was first extrapolated to the continuum at 
fixed quark mass and then fit to continuum XPT forms. SXPT was used only in estimating 
the systematic error of the extrapolation procedure. A correction for finite volume effects 
could not be made with the older approach; instead a finite volume error had to be included. 
The present results and those in Ref. ^3] agree within their respective systematic errors. 

Errors for our direct determination of uiu/md are shown in Table IVTl Adding the scale 
and chiral/ continuum extrapolation errors in quadrature, and symmetrizing as for the decay 
constants, we get the total simulation error. Our final result is 

m„/mrf = 0.43(0)(1)(8) , (50) 

where the errors are from statistics, simulation systematics, and direct EM effects, respec- 
tively. We have allowed for EM effects in a wide range 0<A^ = (5e<2 (see Eqs. f|I^ and 
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niu/md 


central value 


0.429 


errors 


statistics 


0.004 


scale 


0.002 


EM effects 


+0.084 




-0.076 


chir al / continuum 


+0.012 


extrapolation 


-0.006 


chir al /continuum 


error slices 


a 


+0.012 




-0.005 


al 


+0.002 




-0.005 


a2 


+0.012 




-0.004 


a3 


+0.003 


b 


+0.004 




-0.002 


bl 


+0.000 




-0.002 


b2 


+0.000 




-0.002 


b3 


+0.002 


c 






-0.002 



TABLE VI: Same as Table but for rriu/md. 

()4fi|) ). If instead we were to assume the result of Ref. [4^ (A^ = 0.84 ± 0.25), we would 
obtain rriu/md = 0.44(0)(1)(2). Including all choice (3) fits in the systematic error analysis 
would increase the simulation systematic error from 0.01 to 0.02. 

Even with the generous range of possible EM effects, Eq. ()50|) clearly bounds away 
from zero. An alternative way of expressing this is to determine the value of A^; that would 
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be required in order to allow for = 0. We find that it would take an absurdly large 
violation of Dashen's theorem, A^; ^ 8.4. 

Values for quark masses at scale 2 GeV, as well as the ratio ms/ih, were reported in 
Ref. Since that work used the same lattice data, chiral fits, and error analysis as that 
described above, we repeat the results here for completeness: 

mf^ = 76(0) (3) (7) (0) MeV , 
= 2.8(0) (1) (3) (0) MeV , 
ms/m = 27.4(1)(4)(0)(1) (51) 

where the errors are from statistics, simulation, perturbation theory, and electromagnetic 
effects, respectively. 

Combining the current result for rriu/md with the perturbative mass renormalization 



calculated in 



22j (or, equivalently, with tji^^ in Eq. (jSH)), we obtain: 



rrC^ = 1.7(0) (1) (2) (2) MeV 

= 3.9(0)(l)(4)(2)MeV, (52) 

where the errors have the same meaning as in Eq. (j3T|) . and the scale is again 2 GeV. The 
separate EM errors in m„ and rrid are highly, and negatively, correlated, and therefore 
consistent with the large EM error in 171^/171^. 

The results for mu/md and rris/rh in Eqs. (|50p and (|51|) appear inconsistent with the 
relation between ms/md and rriu/md shown in Fig. 1 of Ref. jj]. However, that appears to 
be due to NNLO effects not included in j^. Indeed, Amoros et al. 4^ obtain rriu/md = 
0.46(9) with a NNLO phenomenological analysis. Further, our results for the two ratios are 
consistent with the NNLO relation shown in Fig. 3 of Ref. j^]. 

Since is bounded well away from 0, the issue of the physicality of m„ = ^6] does 
not arise directly here. However, should the existence of non-perturbative, additive shifts 
in masses proposed in Ref. Q be confirmed, there could be some lattice scheme dependence 
in the quark masses and ratios in Eqs. ()5H) and ()52|) . We would expect that such non- 
perturbative effects at the scale of the cutoff would be small at the mass values found 
here, but there is no proof of this. Comparison with three-flavor results with other lattice 
regularizations will be important in resolving this question. 
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Table rVTll shows the systematic errors for the Gasser-Leutwyler low energy constants, Lj. 
Central values are obtained from averaging the results of Fit A and Fit B (on mass sets / 
and // respectively); those results are repeated here for convenience from Table HVl The 
difference between these fit results and the central value is the largest contribution to the 
chiral/continuum extrapolation error for 2L^ — L5 and 2Lq — L4. As discussed in Sees. IVIBI 
and lIXD"5l we include two additional systematic errors here, to be added in quadrature with 
the scale and chiral/continuum extrapolation errors: the NNLO error caused by taking fi — »• 
Utree the NLO terms, and the effect of using a slightly mass- dependent renormalization 
scheme. 

The chiral/continuum "error slices" in Table rVTTl have the same meaning as for the decay 
constants, except that al and c no longer apply. (Slice al shows differences with mass set 
///, which is not included in this part of the analysis, and slice c is not relevant since these 
quantities are themselves fit parameters.) Further, slice a3, the effects of the mass-dependent 
scheme, has now been promoted to a separate error. 

After adding the systematic errors in quadrature and symmetrizing as before, we obtain: 

L5 = 1.9(3)(3) X 10"^ 

L4 = 0.2(3)(3) X 10-^ 

2L8-L5 = -0.2(1)(2) X 10"^ 

2Lq-U = 0.5(2)(4) X 10"^ , (53) 

Systematic errors here are dominated by differences over acceptable fits. The chiral scale is 
taken as A^ = throughout. Including all choice (3) fits in the systematic error analysis 
would not change the errors. 

Reference jS^ makes the following continuum estimates: L5 = 2.3(2)xl0~^, L4 ^ Lg ~ 0; 
while Ref. Q gives L5 = 2.2(5) xlO-^ L4 = 0.0(5) xlQ-^ and Lq = 0.0(3) xlO^^ (which they 
call "conventional estimates.") Here we have converted all the Lj to A^ = scale using 
Eqs. dnni) and dSH). 



The result for 2Ls — L5 is well outside the range that would allow for m„ = 
the context of XPT: 



m 



-3.4 X 10"^ < 2L8-L5 < - 1.8 X 10"^ . (54) 
We note, however, that the constraint on m„ coming from 2L8 — L5 is not independent 
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L5 


L4 


2Ls — L5 


2Lg — 2^4 


central value 


1.89 


0.19 


-0.18 


0.47 


errors 


statistics 


0.28 


0.29 


0.11 


0.16 


scale 


+0.01 


+0.06 


+0.03 


+0.03 




-0.00 


-0.05 


-0.03 


-0.03 


/^trcc 


0.13 


0.01 


0.01 


0.03 


mass dependent 


+0.06 


+0.14 


+0.03 




scheme 








-0.01 


chir al / continuum 


+0.24 


+0.21 


+0.15 


+0.38 


extrapolation 


-0.15 


-0.19 


-0.20 


-0.31 


chir al/ continuum error slices 


Fit A 


1.83 


0.18 


-0.04 


0.24 


Fit B 


1.95 


0.20 


-0.33 


0.70 


a 


+0.21 


+0.19 


+0.15 


+0.33 




-0.14 


-0.17 


-0.18 


-0.27 


a2 


+0.03 


+0.11 


+0.02 


+0.07 




-0.07 


-0.06 


-0.02 


-0.04 


b 


+0.06 


+0.10 


+0.03 


+0.01 




-0.04 


-0.09 


-0.02 


-0.04 


bl 


+0.00 


+0.00 


+0.00 


+0.00 




-0.00 


-0.00 


-0.01 


-0.00 


b2 


+0.06 


+0.07 




+0.01 




-0.03 


-0.09 


-0.02 


-0.02 


b3 


+0.00 


+0.10 


-0.02 


+0.00 



TABLE VII: Central values and error estimates for Li (multiplied by 10^) at chiral scale = m^. 
We show differences from the central values everywhere except for the lines marked Fit A and Fit 
B, where we give the results from those fits. See text for explanations of the various "error slices." 

from the direct determination above. Knowing 2Ls — L5 would fix ruu in NLO up to EM 
effects. The range in Eq. ()54j) comes from unknown NNLO (and EM) terms. Since our fits 
give us some control over NNLO effects, the direct determination seems preferable, and can 
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become quite precise if one uses more information on EM effects. This information may 

ri 

come from phenomenology, e.g., Ref. [4^], or from lattice simulations, perhaps along the 



lines of Refs. 



55| or [56 1 . 



Our approach to computing low energy constants has much in common with earlier work 
by Nelson, Fleming, and Kilcup |7| , who also performed a partially quenched analysis using 
3 flavors of dynamical staggered quarks. The main advances in the current analysis are: (1) 
use of the improved dynamical staggered action and finer lattice spacings, putting us closer 
to continuum physics, and (2) use of SXPT to control lattice artifacts, which are still quite 
large, despite (1). Our result for 2Ls — is marginally consistent with that by Nelson et 
al. [7|; converting their result to chiral scale m^, we get 2L^ — = — 0.57(1)(14) x 10^^. 

The current work will be improved by additional simulations now in progress, including 
coarse lattices at lower strange quark mass [am'^ = 0.03) and fine lattices at lower light quark 
mass (am' = O.lam'^ = .0031). These simulations should enhance our control of the chiral 
extrapolation, the interpolation around the s quark mass, and the extraction of low energy 
constants. In addition, we are beginning a parallel analysis on a large quenched data set. If 
the corresponding SXPT forms can describe that data well, it will increase our confidence 
that the interaction of discretization and chiral effects is understood. Beyond that, planned 
simulations at still finer lattice spacings will provide a better handle on both generic and 
taste- violating discretization errors, thereby significantly reducing the final systematic errors. 
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FIG. 1: Pion masses with random- wall and Coulomb- wall sources and point and Coulomb- wall 
sinks from the coarse set with sea quark lattice masses 0.01,0.05 (see Table DJ. The red crosses are 
random-wall source and Coulomb-wall sink, and the green octagons are Coulomb-wall source and 
point sink (summed over spatial sites to project out the zero momentum states). The blue bursts 
are from a random-wall source and point sink, and the magenta squares have a Coulomb-wall 
source and sink. The lower set of "WW" points include an excited state in the fit. The symbol size 
is proportional to the confidence level of the fit, with the symbol size in the labels corresponding 
to 50%. 
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FIG. 2: Same as Fig. ^but for pion propagator amplitudes. The lower set of "WW" points again 
include an excited state in the fit. The "PW" symbols have been displaced slightly to the right to 
separate them from the "WP" points. 
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FIG. 3: Ratio of pion propagators. Here Pwp is the Coulomb wall source and point sink pion 
propagator, etc. The point source was implemented with a random wall as discussed in the text. 
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FIG. 4: Pion masses (red octagons) and amplitudes (blue crosses) as a function of the minimum 
time distance in the fit, from the fine set with sea quark lattice masses 0.0062,0.031 (see Table QJ. 
The amplitudes have been multiplied by 175. 
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with rux = my] while the second shows "kaon" points with ruy = m'^. Bursts are pion points with 
valence masses equal to sea quark masses. 
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FIG. 8: Same as Fig. [3 but for decay constants. 
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FIG. 9: Squared masses of charged pions for various tastes on the coarse lattices. We use ri to set 
the scale. Tastes that are degenerate by 50(4) symmetry are fit together. 
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FIG. 10: Fit of partially quenched data to continuum form. Data for both and m'^/{mx + my) 
with various vrix and niy values are included in the fit, but only /j^ points with rux = niy are shown. 



74 



0.22 



C!L = 0.65; 0.39 

<> coar$e 
0.20 I— □ fine 

K full, a extrap 



0.18 — 



0.16 — 



0.14 — 




0.00 



O 0.03, 0.05 
O 0.02, 0.05 
o 0.01, 0.05 
o 0.007, 0.05 
O 0.005, 0.05— 

□ 0.0124, 0.031 

□ 0.0062, 0.031 

* extrap; I^y^tematic err 
E3 experiment — 
■full, cont., m'g full, cont., 



0.05 



0.10 



FIG. 11: "Pion" decay constants with rrix = ruy vs. quark mass, in units of ri. The relative 
mass renormalization of coarse and fine lattices has been included so that data from both may 
be presented on the same plot. Lines come from "Fit B," a single NNLO fit, Eqs. (|32|) and ()33() . 
to entire data subset // (decay constants and masses). The cyan solid line and red dotted line 
represent the fit function in "full QCD" (valence and sea masses set equal) after extrapolation of 
parameters to the continuum. The cyan solid line keeps the s quark mass equal to m'g on the fine 
lattices; while the red dotted line (just barely visible above the cyan solid line) replaces m'g with 
the physical mass m^. The cyan fancy squares result from extrapolation of full QCD points to the 
continuum at fixed quark mass; their agreement with the cyan solid line is a consistency check. 
Points and fit lines have been corrected for finite volume effects. 
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FIG. 12: Same as Fig. ^2 (Fit B), but squared "pion" masses divided by quark mass are shown. 
Because taste splittings are smaller for the fine lattices, the average meson mass changes more 
rapidly with quark mass, and there is greater curvature at small quark mass. 
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FIG. 13: "Pion" decay constants with rux = my in data subset ///. Lines come from Fit C, a 
single NNNLO fit to masses and decay constants. The cyan sohd hne, red dotted hne, and cyan 
fancy squares have the same meaning as in Fig. 1111 Points (and fit lines) have been corrected for 
finite volume effects. 
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FIG. 14: Similar to Fig. El (Fit C), but for "kaon" decay constants, with niy fixed as closely as 
possible on each set to m^^^^ . The red dotted line has parameters extrapolated to the continuum, 
the strange valence and sea masses set to the physical value, and light sea quark mass equal to 
light valence mass tjIx- The green dotted extension has the light sea quark mass fixed at m, and 
the valence mass rux continuing down to (thus giving //^+). 
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FIG. 16: Squared meson masses in subset ///, as a function of mx/m'g. The lines are from Fit C. 
We show results from two lattices: a coarse lattice with sea quark masses am! = 0.01, am'^ = 0.05, 
and a fine lattice with am! = 0.0062, am'^ = 0.031. Three sets of "kaon" points with niy = 
rrig, 0.8m^, 0.6m^, are plotted for each lattice. "Pion" points have nix = my The statistical errors 
in the points are not visible on this scale. The red dashed lines give the continuum-extrapolated 
fit (now as a function of mx/ms), and the magenta vertical dotted line shows the physical m/ms 
obtained. The extension (shown in green) of the red dashed kaon line until it intersects the QCD 
value then gives mu/ms, from which we find mu/ih or mu/md- 
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81 



— I 1 1 1 1 1 1 1 r 

180 I — o X f jj (w/ taste viols) 



> 
CD 



160 — 



140 



120 — 



o X f jj (w/ taste viols; a^a^ extrap)- 
□ + fjj f„ (taste viols->0) 
o * f extrap params ^ - 



Q 
CD 



X 



J I I L 



0.0 0.5 

(cxga^)/(aga^) 



S /coarse 



X 



J I I L 



1.0 



FIG. 18: Dependence of chirally-extrapolated /jr and fx on lattice spacing. The blue octagons 
and crosses show values at fixed lattice spacing (using "method (1)") and those extrapolated to the 
continuum linearly in aa^. For the green octagons and crosses, the extrapolation is linear in a^a^. 
The magenta squares and pluses have the taste-violating effects at fixed lattice spacing removed 
with SXPT ("method (2)"); the points are then extrapolated to the continuum linearly in aa^. 
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is used everywhere. 
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FIG. 20: Convergence of SU{3)l x SU{3)r XPT for decay constants. Our results on mass set // 
for and fx at LO (dash-dotted black line), NLO (dotted red and blue curves), and NNLO (solid 
red and blue curves) are shown. The chiral parameters have been extrapolated to the continuum. 
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cyan fancy squares are found by extrapolating our full QCD data to the continuum limit at fixed 
quark mass. All results come from Fit B. 
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85 



Nl 


1 1 1 1 


I I 


1 1 1 1 1 1 


c 

Nl 

X 


chiral 
limit 


NLO 7T ... 


... NLO K - 






- 


+ 

X 


LO 


NNLO 7T ^ 












6 






- 




1 1 1 1 


1 1 1 1 1 1 1 


1 1 1 1 1 1 



0.00 0.05 0.10 0.15 

FIG. 22: Same as Fig. EH but for Fit A (data subset /). 



86 



